Compositions and methods for modulating HDL cholesterol and triglyceride levels

ABSTRACT

The invention features methods for treating patients having low HDL, a higher than normal triglyceride level, or a cardiovascular disease by administering compounds that modulate ABC1 expression or activity.

[0001] This application is a continuation of, and claims priority of, U.S. application Ser. No. 09/654,323, filed 1 Sep. 2000, which claims priority of U.S. Provisional Application No. 60/213,958 filed Jun. 23, 2000, and was a continuation-in-part of U.S. application Ser. No. 09/526,193, filed Mar. 15, 2000, now U.S. Pat. No. 6,617,122, which claims priority of U.S. Provisional Application No. 60/124,702, filed Mar. 15, 1999; U.S. Provisional Application No. 60/138,048, filed Jun. 8, 1999; U.S. Provisional Application No. 60/139,600, filed Jun. 17, 1999; and U.S. Provisional Application No. 60/151,977, filed Sep. 1, 1999, the disclosures of all of which are hereby incorporated by reference in their entirety.

BACKGROUND OF THE INVENTION

[0002] Low HDL cholesterol (HDL-C), or hypoalphalipoproteinemia, is a blood lipid abnormality which correlates with a high risk of cardiovascular disease (CVD), in particular coronary artery disease (CAD), but also cerebrovascular disease, coronary restenosis, and peripheral vascular disease. HDL-C levels are influenced by both environmental and genetic factors.

[0003] Epidemiological studies have consistently demonstrated that plasma HDL-C concentration is inversely related to the incidence of CAD. HDL-C levels are a strong graded and independent cardiovascular risk factor. Protective effects of an elevated HDL-C persist until 80 years of age. A low HDL-C is associated with an increased CAD risk even with normal (<5.2 mmol/l) total plasma cholesterol levels. Coronary disease risk is increased by 2% in men and 3% in women for every 1 mg/dL (0.026 mmol/l) reduction in HDL-C and in the majority of studies this relationship is statistically significant even after adjustment for other lipid and non-lipid risk factors. Decreased HDL-C levels are the most common lipoprotein abnormality seen in patients with premature CAD. Four percent of patients with premature CAD have an isolated form of decreased HDL-C levels with no other lipoprotein abnormalities while 25% have low HDL-C levels with accompanying hypertriglyceridemia.

[0004] Even in the face of other dyslipidemias or secondary factors, HDL-C levels are important predictors of CAD. In a cohort of diabetics, those with isolated low HDL-C had a 65% increased death rate compared to diabetics with normal HDL-C levels (>0.9 mmol/l). Furthermore, it has been shown that even within high risk populations, such as those with familial hypercholesterolemia, HDL-C level is an important predictor of CAD. Low HDL-C levels thus constitute a major, independent, risk for CAD.

[0005] These findings have led to increased attention to HDL-C levels as a focus for treatment, following the recommendations of the National Cholesterol Education Program. These guidelines suggest that HDL-C values below 0.9 mmol/l confer a significant risk for men and women. As such, nearly half of patients with CAD would have low HDL-C. It is therefore crucial that we obtain a better understanding of factors which contribute to this phenotype. In view of the fact that pharmacological intervention of low HDL-C levels has so far proven unsatisfactory, it is also important to understand the factors that regulate these levels in the circulation as this understanding may reveal new therapeutic targets.

[0006] Absolute levels of HDL-C may not always predict risk of CAD. In the case of CETP deficiency, individuals display an increased risk of developing CAD, despite increased HDL-C levels. What seems to be important in this case is the functional activity of the reverse cholesterol transport pathway, the process by which intracellular cholesterol is trafficked out of the cell to acceptor proteins such as ApoAI or HDL. Other important genetic determinants of HDL-C levels, and its inverse relation with CAD, may reside in the processes leading to HDL formation and intracellular cholesterol trafficking and efflux. To date, this process is poorly understood, however, and clearly not all of the components of this pathway have been identified. Thus, defects preventing proper HDL-mediated cholesterol efflux may be important predictors of CAD. Therefore it is critical to identify and understand novel genes involved in the intracellular cholesterol trafficking and efflux pathways.

[0007] HDL particles are central to the process of reverse cholesterol transport and thus to the maintenance of tissue cholesterol homeostasis. This process has multiple steps which include the binding of HDL to cell surface components, the acquisition of cholesterol by passive absorption, the esterification of this cholesterol by LCAT and the subsequent transfer of esterified cholesterol by CETP, to VLDL and chylomicron remnants for liver uptake. Each of these steps is known to impact the plasma concentration of HDL.

[0008] Changes in genes for ApoAI-CIII, lipoprotein lipase, CETP, hepatic lipase, and LCAT all contribute to determination of HDL-C levels in humans. One rare form of genetic HDL deficiency is Tangier disease (TD), diagnosed in approximately 40 patients world-wide, and associated with almost complete absence of HDL-C levels (listed in OMIM as an autosomal recessive trait (OMIM 205400)). These patients have very low HDL-C and ApoAI levels, which have been ascribed to impairment of lipid transport and hypercatabolism of nascent HDL and ApoAI, due to a delayed acquisition of lipid and resulting failure of conversion to mature HDL. TD patients accumulate cholesterol esters in several tissues, resulting in characteristic features, such as enlarged yellow tonsils, corneal opacity, hepatosplenomegaly, peripheral neuropathy, and cholesterol ester deposition in the rectal mucosa. Defective removal of cellular cholesterol and phospholipids by ApoAI as well as a marked deficiency in HDL mediated efflux of intracellular cholesterol has been demonstrated in TD fibroblasts. Even though this is a rare disorder, defining its molecular basis could identify pathways relevant for cholesterol regulation in the general population. The decreased availability of free cholesterol for efflux in the surface membranes of cells in Tangier Disease patients appears to be due to a defect in cellular lipid metabolism or trafficking. Approximately 45% of Tangier patients have signs of premature CAD, suggesting a strong link between decreased cholesterol efflux, low HDL-C and CAD. increased As cholesterol is observed in the rectal mucosa of persons with TD, the molecular mechanism responsible for TD may also regulate cholesterol adsorption from the gastrointestinal (GI) tract.

[0009] A more common form of genetic HDL deficiency occurs in patients who have low plasma HDL-C usually below the 5th percentile for age and sex (OMIM 10768), but an absence of clinical manifestations specific to Tangier disease (Marcil et al., Arterioscler. Thromb. Vasc. Biol. 19:159-169, 1999; Marcil et al., Arterioscler. Thromb. Vasc. Biol. 15:1015-1024, 1995). These patients have no obvious environmental factors associated with this lipid phenotype, and do not have severe hypertriglyceridemia nor have known causes of severe HDL deficiency (mutations in ApoAI, LCAT, or LPL deficiency) and are not diabetic. The pattern of inheritance of this condition is most consistent with a Mendelian dominant trait (OMIM 10768).

[0010] The development of drugs that regulate cholesterol metabolism has so far progressed slowly. Thus, there is a need for a better understanding of the genetic components of the cholesterol efflux pathway. Newly-discovered components can then serve as targets for drugs.

SUMMARY OF THE INVENTION

[0011] In a first aspect, the invention features a method for treating a patient diagnosed as having a lower than normal HDL-cholesterol level or a higher than normal triglyceride level. The method includes administering to the patient a compound that modulates LXR-mediated transcriptional activity. Preferably, the compound is administered to the patient with a pharmaceutically acceptable carrier. The compound may be selected, for example, from the group consisting of 24-(S),25-epoxycholesterol; 24(S)-hydroxycholesterol; 22-(R)-hydroxycholesterol; 24(R),25-epoxycholesterol; 22(R)-hydroxy-24(S),25-epoxycholesterol; 22(S)-hydroxy-24(R),25-epoxycholesterol; 24-(S),25-iminocholesterol; methyl-38-hydroxycholonate; N,N-dimethyl-3,β-hydroxycholonamide; 24(R)-hydroxycholesterol; 22(S)-hydroxycholesterol; 22(R),24(S)-dihydroxycholesterol; 25-hydroxycholesterol; 22(R)-hydroxycholesterol; 22(S)-hydroxycholesterol; 24(S),25-dihydroxycholesterol; 24(R),25-dihydroxycholesterol; 24,25-dehydrocholesterol; 25-epoxy-22(R)-hydroxycholesterol; 20(S)-hydroxycholesterol; (20R,22R)-cholest-5-ene-3β,20,22-triol; 4,4-dimethyl-5-α-cholesta-8,14,24-trien-3-β-ol; 7α-hydroxy-24(S),25-epoxycholesterol; 7β-hydroxy-24(S),25-epoxycholesterol; 7-oxo-24(S),25-expoxycholesterol; 7α-hydroxycholesterol; 7-oxocholesterol;

[0012] and desmosterol. In one preferred embodiment, the compound is an oxysterol.

[0013] In a second aspect, the invention features another method for treating a patient diagnosed as having a lower than normal HDL-cholesterol level or a higher than normal triglyceride level. This method includes administering to the patient a compound that modulates RXR-mediated transcriptional activity. RXR-modulating compounds include hetero ethylene derivatives; tricyclic retinoids; trienoic retinoids; benzocycloalkenyl-alka:di- or trienoic acid derivatives; bicyclic-aromatic compounds and their derivatives; bicyclylmethyl-aryl acid derivatives; phenyl-methyl heterocyclic compounds; tetrahydro-napthyl compounds; arylthio-tetrahydro-naphthalene derivatives and heterocyclic analogues; 2,4-pentadienoic acid derivatives; tetralin-based compounds; nonatetraenoic acid derivatives; SR11237; dexamethasone; hydroxy, epoxy, and carboxy derivatives of methoprene; bicyclic benzyl, pyridinyl, thiophene, furanyl, and pyrrole derivatives; benzofuran-acrylic acid derivatives; aryl-substituted and aryl and (3-oxo-1-propenly)-substituted benzopyran, benzothiopyran, 1,2-dihydroquinoline, and 5,6-dihydronaphthalene derivatives; vitamin D3 (1,25-dihydroxyvitamin D3) and analogs; 24-hydroxylase inhibitor; mono-or polyenic carboxylic acid derivatives; tetrahdroquinolin-2-one-6 or 7-yl and related derivatives; tetrahydronaphthalene; oxyiminoalkanoic acid derivatives; LG 100268; and LGD 1069.

[0014] In a third aspect, the invention features a method for determining whether a candidate compound modulates ABC1 expression by performing the steps of: (a) providing a nucleic acid molecule that includes an ABC1 regulatory region or promoter operably linked to a reporter gene; (b) contacting the nucleic acid molecule with the candidate compound; and (c) measuring expression of the reporter gene, wherein altered reporter gene expression, relative to a control not contacted with the compound, indicates that the candidate compound modulates ABC1 expression. In various preferred embodiments, the regulatory region includes 50 or more consecutive amino acids selected from nucleotides 5854 to 6694, 7756 to 8318, 10479 to 10825, 15214 to 16068, 21636 to 22111, 27898 to 28721, 32951 to 33743, 36065 to 36847, 39730 to 40577, 4543 to 5287, or 45081 to 55639 of SEQ ID NO: 1. In other preferred embodiments, the regulatory region 50 or more consecutive amino acids selected from nucleotides 1 to 28,707 or 29,011 to 53,228 of SEQ ID NO: 1. Preferably, the regulatory region includes a binding site for a transcription factor selected from a group consisting of LXRs, RXRs, RORs, SREBPs, and PPARs.

[0015] In a fourth aspect, the invention features a method for determining whether a person has an altered risk for developing cardiovascular disease. This method includes examining the person's ABC1 gene for polymorphisms or mutations. The presence of a polymorphism or mutation associated with cardiovascular disease indicates the person has an altered risk for developing cardiovascular disease.

[0016] In a related aspect, the invention features a method for predicting a person's response to a drug by determining whether the person has a polymorphism in an ABC1 gene that alters the person's response to the drug. Preferred polymorphisms are depicted in FIG. 4. In preferred embodiments of the fifth and sixth aspects, the polymorphism is in the 5′ regulatory region of ABC1.

[0017] In a sixth aspect, the invention features a substantially purified LXR response element comprising the nucleotide sequence AGATCANNNNAGGTCA, wherein each N is, independently, C, T, G, or A (SEQ ID NO: 231). Preferably, the LXR response element has the sequence AGATCACTTGAGGTCA (SEQ ID NO: 232). Even more preferably, the LXR response element consists essentially of the nucleotide sequence AGATCANNNNAGGTCA, wherein each N is, independently, C, T, G, or A (SEQ ID NO: 231).

[0018] In a seventh aspect, the invention features a substantially pure nucleic acid molecule that consists essentially of a region that is substantially identical to at least 50, 100, 150, 300, 500, 750, 1000, 2000, 3000, 4000, 5000 or all of the consecutive nucleotides selected from nucleotides 5854 to 6694, 7756 to 8318, 10479 to 10825, 15214 to 16068, 21636 to 22111, 27898 to 28721, 32951 to 33743, 36065 to 36847, 39730 to 40577, 45081 to 55639, 4543 to 5287, 59188 to 60306, 60689 to 63548, 63574 to 65110, 65030 to 68312, 68605 to 73375, 73395 to 74692, 75586 to 77103, 74774 to 74920, 77519 to 87679, 87651 to 94160, 96916 to 97634, 94408 to 96595, 97807 to 98989, 100369 to 107171, 107179 to 107983, 108039 to 108998, 109222 to 118212, 118612 to 123911, 124586 to 138185, 137773 to 138393, 147497 to 148051, 158490 to 159118, 123718 to 125077, 137773 to 138912, 139304 to 139699, 139351 to 146359, 146867 to 147637, 147733 to 149404, 149858 to 152699, 153064 to 153916, 153978 to 158516, 158719 to 160272, 160375 to 164458, 165279 to 169814, 164215 to 164592, 164786 to 165133, 165125 to 165429, 169882 to 170189, 170067 to 174018, 176845 to 178875, 179113 to 180606, and 181723 to 183284 of SEQ ID NO: 1. In a related aspect, the invention features a substantially pure nucleic acid molecule that has a region that is substantially identical to nucleotides 5854 to 6694, 7756 to 8318, 10479 to 10825, 15214 to 16068, 21636 to 22111, 27898 to 28721, 32951 to 33743, 36065 to 36847, 39730 to 40577, 45081 to 55639, 4543 to 5287, 59188 to 60306, 60689 to 63548, 63574 to 65110, 65030 to 68312, 68605 to 73375, 73395 to 74692, 75586 to 77103, 74774 to 74920, 77519 to 87679, 87651 to 94160, 96916 to 97634, 94408 to 96595, 97807 to 98989, 100369 to 107171, 107179 to 107983, 108039 to 108998, 109222 to 118212, 118612 to 123911, 124586 to 138185, 137773 to 138393, 147497 to 148051, 158490 to 159118, 123718 to 125077, 137773 to 138912, 139304 to 139699, 139351 to 146359, 146867 to 147637, 147733 to 149404, 149858 to 152699, 153064 to 153916, 153978 to 158516, 158719 to 160272, 160375 to 164458, 165279 to 169814, 164215 to 164592, 164786 to 165133, 165125 to 165429, 169882 to 170189, 170067 to 174018, 176845 to 178875, 179113 to 180606, or 181723 to 183284 of SEQ ID NO: 1. Preferred nucleic acid molecules have a region that is substantially identical or identical to nucleotides 1 to 28,707 of SEQ ID NO: 1 or nucleotides 29,011 to 53,228 of SEQ ID NO: 1.

[0019] In an eighth aspect, the invention features a method of treating a human having a lower than normal HDL-cholesterol level, a higher than normal triglyceride level, or a cardiovascular disease, including administering to the human an ABC1 polypeptide, or a cholesterol- or triglyceride-regulating fragment thereof, or a nucleic acid molecule encoding an ABC1 polypeptide, or a cholesterol- or triglyceride-regulating fragment thereof. In a preferred embodiment, the human has a low cholesterol or high triglyceride level relative to normal. Preferably, the ABC1 polypeptide is wild-type ABC1, or has a mutation that increases its stability or its biological activity. Preferably, the nucleic acid molecule is operably linked to a promoter and contained in an expression vector. Preferred mutations include the R_K mutation at position 219 and the V_A mutation at position 399 of ABC1. A preferred biological activity is improved regulation of cholesterol transport.

[0020] In a ninth aspect, the invention features a method of treating or preventing a lower than normal HDL-cholesterol level, a higher than normal triglyceride level, or a cardiovascular disease, including administering to an animal (e.g., a human) a compound that mimes the activity of wild-type ABC1, R219K ABC1, or V399A ABC1 or modulates the biological activity of ABC1. One preferred cardiovascular disease that can be treated using the methods of the invention is coronary artery disease. Others include cerebrovascular disease and peripheral vascular disease.

[0021] The discovery that the ABC1 gene and protein are involved in cholesterol transport that affects serum HDL levels allows the ABC1 protein and gene to be used in a variety of diagnostic tests and assays for identification of HDL-increasing, triglyceride-lowering, or CVD-inhibiting drugs. In one family of such assays, the ability of domains of the ABC1 protein to bind ATP is utilized; compounds that enhance this binding are potential HDL-increasing or triglyceride-lowering drugs. Similarly, the anion transport capabilities and membrane pore-forming functions in cell membranes can be used for drug screening.

[0022] In a tenth aspect, ABC1 expression can also serve as a diagnostic tool for a lower than normal HDL-cholesterol level, a higher than normal triglyceride level, or CVD; determination of the genetic subtyping of the ABC1 gene sequence can be used to subtype individuals or families with lower than normal HDL levels or higher than normal triglyceride levels to determine whether the lower than normal HDL or higher than normal triglyceride phenotype is related to ABC1 function. This diagnostic process can lead to the tailoring of drug treatments according to patient genotype (referred to as pharmacogenomics), including prediction of the patient's response (e.g., increased or decreased efficacy or undesired side effects upon administration of a compound or drug).

[0023] Antibodies to an ABC1 polypeptide can be used both as therapeutics and diagnostics. Antibodies are produced by immunologically challenging a B-cell-containing biological system, e.g., an animal such as a mouse, with an ABC1 polypeptide to stimulate production of anti-ABC1 protein by the B-cells, followed by isolation of the antibody from the biological system. Such antibodies can be used to measure ABC1 polypeptide in a biological sample such as serum, by contacting the sample with the antibody and then measuring immune complexes as a measure of the ABC1 polypeptide in the sample. Antibodies to ABC1 can also be used as therapeutics for the modulation of ABC1 biological activity.

[0024] Thus, in an eleventh aspect, the invention features a purified antibody that specifically binds to ABC1. In one preferred embodiment, the antibody modulates cholesterol or triglyceride levels when administered to a mammal.

[0025] In a twelfth aspect, the invention features a method for determining whether candidate compound is useful for modulating cholesterol or triglyceride levels, the method including the steps of: (a) providing an ABC1 polypeptide; (b) contacting the polypeptide with the candidate compound; and (c) measuring binding of the ABC1 polypeptide, wherein binding of the ABC1 polypeptide indicates that the candidate compound is useful for modulating cholesterol or triglyceride levels.

[0026] In a thirteenth aspect, the invention features a method for determining whether a candidate compound is useful for the treatment of a lower than normal HDL-cholesterol level, a higher than normal triglyceride level, or a cardiovascular disease. The method includes (a) providing an ABC transporter (e.g., ABC1); (b) contacting the transporter with the candidate compound; and (c) measuring ABC transporter biological activity, wherein increased ABC transporter biological activity, relative to a transporter not contacted with the compound, indicates that the candidate compound is useful for the treatment of a lower than normal HDL-cholesterol level, a higher than normal triglyceride level, or a cardiovascular disease. Preferably the ABC transporter is in a cell or a cell free assay system.

[0027] In a fourteenth aspect, the invention features a method for determining whether candidate compound is useful for modulating cholesterol or triglyceride levels. The method includes (a) providing a nucleic acid molecule comprising an ABC transporter promoter operably linked to a reporter gene; (b) contacting the nucleic acid molecule with the candidate compound; and (c) measuring expression of the reporter gene, wherein increased expression of the reporter gene, relative to a nucleic acid molecule not contacted with the compound, indicates that the candidate compound is useful for modulating cholesterol or triglyceride levels.

[0028] In a fifteenth aspect, the invention features a non-human mammal having a transgene comprising a nucleic acid molecule encoding a mutated ABC1 polypeptide. In one embodiment, the mutation is a dominant-negative mutation, such as the M_T mutation at position 1091 of ABC1.

[0029] In a sixteenth aspect, the invention features an expression vector, a cell, or a non-human mammal that includes an ABC1 nucleic acid molecule of the present invention.

[0030] In a related aspect, the invention features a cell from a non-human mammal having a transgene that includes a nucleic acid molecule of the present invention.

[0031] In an eighteenth aspect, the invention features a method for determining whether a candidate compound decreases the inhibition of a dominant-negative ABC1 polypeptide. The method includes (a) providing a cell expressing a dominant-negative ABC1 polypeptide; (b) contacting the cell with the candidate compound; and (c) measuring ABC1 biological activity of the cell, wherein an increase in the ABC1 biological activity, relative to a cell not contacted with the compound, indicates that the candidate compound decreases the inhibition of a dominant-negative ABC1 polypeptide. A preferred dominant-negative ABC1 polypeptide is M1091T ABC1.

[0032] In a nineteenth aspect, the invention features a method of determining in a subject a propensity for a disease or condition selected from the group consisting of a lower than normal HDL level, a higher than normal triglyceride level, and a cardiovascular disease. This method involves determining the presence or absence of at least one ABC1 polymorphism in the polynucleotide sequence of an ABC1 regulatory region, promoter, or coding sequence or in the amino acid sequence of an ABC1 protein in a sample obtained from the subject, wherein the presence or absence of the ABC1 polymorphism is indicative of a risk for the disease or condition. Preferably, the method also includes analyzing at least five ABC1 polymorphic sites in the polynucleotide sequence or the amino acid sequence.

[0033] In a twentieth aspect, the invention features a method for determining whether an ABC1 polymorphism is indicative of a risk in a subject for a disease or condition selected from the group consisting of a lower than normal HDL level, a higher than normal triglyceride level, and a cardiovascular disease. The method includes (a) determining whether the prevalence of the disease or condition in a first subject or set of subjects differs from the prevalence of the disease or condition in a second subject or set of subjects; (b) analyzing the polynucleotide sequence of an ABC1 regulatory region, promoter, or coding sequence or the amino acid sequence of an ABC1 protein in a sample obtained from the first subject or set of subjects and the second subject or set of subjects; and

[0034] (c) determining whether at least one ABC1 polymorphism differs between the first subject or set of subjects and the second subject or set of subjects, wherein the presence or absence of the ABC1 polymorphism is correlated with the prevalence of the disease or condition, thereby determining whether the ABC1 polymorphism is indicative of the risk. Preferably, the method further includes analyzing at least five ABC1 polymorphic sites in the polynucleotide sequence of an ABC1 regulatory region, promoter, or coding sequence or in the amino acid sequence of ABC1.

[0035] In a twenty-first aspect, the invention provides an electronic database having a plurality of sequence records of ABC1 polymorphisms correlated to records of predisposition to or prevalence of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease.

[0036] In a twenty-second aspect, the invention features a method for selecting a preferred therapy for modulating ABC1 activity or expression in a subject. This method includes (a) determining the presence or absence of at least one ABC1 polymorphism in the polynucleotide sequence of an ABC1 regulatory region, promoter, or coding sequence or in the amino acid sequence of an ABC1 protein in a sample obtained from the subject, wherein the presence or absence of the ABC1 polymorphism is indicative of the safety or efficacy of at least one therapy for modulating ABC1 expression or activity; and (b) determining a preferred therapy for modulating ABC1 expression or activity in the subject. Preferably, the method further includes analyzing at least five ABC1 polymorphic sites in the polynucleotide sequence of an ABC1 regulatory region, promoter, or coding sequence or the amino acids sequence of ABC1.

[0037] In a twenty-third aspect, the invention provides a method for determining whether a candidate compound is useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method includes (a) providing an assay system having a measurable ABC1 biological activity; (b) contacting the assay system with the candidate compound; and (c) measuring ABC1 biological activity or ABC1 phosphorylation. Modulation of ABC1 biological activity or ABC1 phosphorylation in this assay system, relative to the ABC1 biological activity or ABC1 phosphorylation in a corresponding control assay system not contacted with the candidate compound, indicates that the candidate compound is useful for the treatment of the disease or condition. In preferred embodiments, the assay system is a cell based system or a cell free system. Preferably, the candidate compound modulates both ABC1 protein phosphorylation and ABC1 activity.

[0038] In a twenty-fourth aspect, the invention provides a method for identifying a compound to be tested for an ability to ameliorate a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method includes

[0039] (a) contacting a subject or cell with a candidate compound; (b) measuring ABC1 expression, activity, or protein phosphorylation in the subject or cell. Altered ABC1 expression, activity, or protein phosphorylation in this subject or cell; relative to the ABC1 expression, activity, or protein phosphorylation in a corresponding control subject or cell not contacted with the candidate compound; identifies the candidate compound as a compound to be tested for an ability to ameliorate the disease or condition. Preferably, the candidate compound modulates both ABC1 protein phosphorylation and the ABC1 activity.

[0040] In a twenty-fifth aspect, the invention provides a method for determining whether a candidate compound is useful for modulating a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. The method includes (a) providing a cell expressing an ABC1 gene or a fragment thereof;

[0041] (b) contacting the cell with the candidate compound; and (c) measuring ABC1 activity of the cell. Altered ABC1 activity in this cell, relative to the ABC1 activity in a corresponding control cell not contacted with the compound, indicates that the candidate compound is useful for modulating the disease or condition.

[0042] In a twenty-six aspect, the invention provides a method for determining whether a candidate compound is useful for modulating a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method includes (a) contacting a cell expressing an ABC1 protein with the candidate compound; and (b) measuring the phosphorylation of the ABC1 protein. Altered ABC1 protein phosphorylation in this cell, relative to the ABC1 protein phosphorylation in a corresponding control cell not contacted with the candidate compound, indicates that the is useful for modulating the disease or condition.

[0043] In a twenty-seventh aspect, the invention provides a compound useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. The compound modulates ABC1 biological activity, and is identified by the steps of (a) providing an assay system having a measurable ABC1 biological activity; (b) contacting the assay system with the compound; and (c) measuring ABC1 biological activity, wherein modulation of ABC1 biological activity, relative to the ABC1 biological activity in a corresponding control assay system not contacted with the compound, indicates that the compound is useful for the treatment of the disease or condition.

[0044] In a twenty-eighth aspect, the invention provides a compound useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. The compound induces a change in ABC1 biological activity that mimics the change in ABC1 biological activity induced by the R219K ABC1 mutation.

[0045] In a twenty-ninth aspect, the invention provides a compound useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. The compound binds or interacts with residue R219 of ABC1, thereby mimicking the change in ABC1 activity induced by the R219K ABC1 mutation.

[0046] In a thirtieth aspect, the invention provides a compound useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. The compound induces a change in ABC1 biological activity that mimics the change in ABC1 biological activity induced by the V339A ABC1 mutation.

[0047] In a thirty-first aspect, the invention provides a compound useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. The compound binds or interacts with residue V399 of ABC1, thereby mimicking the change in ABC1 activity induced by the V399A ABC1 mutation.

[0048] In a thirty-second aspect, the invention provides a compound that modulates ABC1 activity and binds or interacts with an amino acid of ABC1, wherein the amino acid is a residue selected from amino acids 119 to 319 of ABC1 (SEQ ID NO: 5) or amino acids 299 to 499 of ABC1 (SEQ ID NO: 5).

[0049] In a thirty-second aspect, the invention provides a method for determining whether a candidate compound is useful for the treatment a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method involves (a) providing an assay system having a measurable LXR biological activity; (b) contacting the assay system with the candidate compound; and

[0050] (c) measuring LXR biological activity, wherein modulation of LXR biological activity, relative to the LXR biological activity in a corresponding control assay system not contacted with the candidate compound, indicates that the candidate compound is useful for the treatment of the disease or condition.

[0051] In a thirty-third aspect, the invention provides method for determining whether a candidate compound is useful for modulating ABC1 biological activity. This method involves (a) providing an assay system having a measurable LXR biological activity; (b) contacting the assay system with the candidate compound; and (c) measuring LXR biological activity, wherein modulation of LXR biological activity, relative to the LXR biological activity in a corresponding control assay system not contacted with the candidate compound, indicates that the candidate compound is useful for modulating ABC1 biological activity. Preferably, the LXR biological activity is modulation of ABC1 expression.

[0052] In a thirty-fourth aspect, the invention provides method for identifying a compound to be tested for an ability to modulate ABC1 biological activity. This method involves (a) contacting a subject or cell with a candidate compound;

[0053] (b) assaying the activity of the LXR gene product in the subject or cell; wherein modulation of the activity, relative to the activity in a corresponding control subject or cell not contacted with the candidate compound, identifies the candidate compound as a compound to be tested for an ability to modulate the biological activity of ABC1.

[0054] In a thirty-fifth aspect, the invention provides the use of an LXR gene product in an assay to identify compounds useful for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease.

[0055] In a thirty-sixth aspect, the invention features the use of a compound that modulates the activity or expression of an LXR gene product for the treatment of a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease.

[0056] In a thirty-seventh aspect, the invention provides a method for identifying a compound to be tested for an ability to treat a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method involves

[0057] (a) providing an assay system having a measurable LXR biological activity;

[0058] (b) contacting the assay system with the candidate compound; and (c) measuring LXR biological activity, wherein modulation of the LXR biological activity, relative to the LXR biological activity in a corresponding control assay system not contacted with the candidate compound, identifies the candidate compound as a compound to be tested for an ability to treat the disease or condition.

[0059] In a thirty-eight aspect, the invention provides a method for screening a candidate LXR agonist for the ability to treat a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method involves

[0060] (a) contacting the cell with the candidate LXR agonist; and (b) measuring cholesterol efflux activity of the cell, wherein an increase in the cholesterol efflux activity in the cell, relative to the cholesterol efflux in a corresponding control cell not contacted with the candidate LXR agonist, indicates that the candidate LXR agonist is useful for treating the disease or condition.

[0061] In a thirty-ninth aspect, the invention provides a method for screening a candidate LXR modulating compound for the ability to treat a disease or condition selected from the group consisting of a lower than normal HDL cholesterol level, a higher than normal triglyceride level, and a cardiovascular disease. This method involves (a) contacting a cell with the candidate LXR modulating compound; and

[0062] (b) measuring ABC1 biological activity of the cell; wherein an increase in ABC1 biological activity in the cell, relative to the ABC1 biological activity in a corresponding control cell not contacted with the LXR modulating compound, indicates that the LXR modulating compound is useful for treating the disease or condition.

[0063] In another aspect, the invention provides a method for determining whether a candidate compound is useful for modulating triglyceride levels. The method involves (a) providing a cell comprising an ABC1 polypeptide comprising amino acids 1 to 60 of SEQ ID NO: 5; (b) contacting the cell with the candidate compound; and (c) measuring the half-life of the ABC1 polypeptide,

[0064] wherein an increase in said half-life, relative to the half-life in a corresponding control cell not contacted with the compound, indicates that the candidate compound is useful for modulating triglyceride levels.

[0065] In a related aspect, the invention features method for determining whether a candidate compound mimics ABC1 biological activity. The method includes (a) providing a cell that is not expressing an ABC1 polypeptide; (b) contacting the cell with the candidate compound; and (c) measuring ABC1 biological activity of the cell, wherein altered ABC1 biological activity, relative to a corresponding control cell not contacted with the compound, indicates that the candidate compound modulates ABC1 biological activity. Preferably, the cell has an ABC1 null mutation. In one preferred embodiment, the cell is in a mouse or a chicken (e.g., a WHAM chicken) in which its ABC1 gene has been mutated.

[0066] In a preferred embodiment of the screening methods of the present invention, the cell is in an animal. The preferred biological activity is transport of cholesterol (e.g., HDL cholesterol or LDL cholesterol) or interleukin-1, or is binding or hydrolysis of ATP by the ABC1 polypeptide. Preferably, the ABC1 polypeptide used in the screening methods includes amino acids 1-60 of SEQ ID NO: 5. Alternatively, the ABC1 polypeptide can include a region encoded by a nucleotide sequence that hybridizes under high stringency conditions to nucleotides 75 to 254 of SEQ ID NO: 6. Preferably, the subject is a human. Preferably, the cell or assay system has an exogenously supplied copy of an LXRE selected from the group consisting of SEQ ID NO: 94, SEQ ID NO: 92, and the LXRE consensus motif at nucleotide −7670 of the 3′ end of intron 1. For various methods of the invention, a preferred LXR biological activity is modulation of ABC1 expression. A preferred LXR gene product is an ABC1 nucleic acid molecule or protein.

[0067] It is also contemplated that additional sequence of the ABC1 regulatory regions may be determined by sequencing the rest of the 4I8, 31J20, 47O19, or 179G21 Research Genetics RPCI-11 BACs using the methods described herein. Substantially pure nucleic acids containing regions substantially identical to at least 50, 100, 150, 300, 500, 750, 1000, 2000, 3000, 4000, 5000 consecutive nucleotides of these regions may be used in the methods of the present invention.

[0068] By “polypeptide” is meant any chain of more than two amino acids, regardless of post-translational modification such as glycosylation or phosphorylation.

[0069] By “reporter gene” is meant any gene which encodes a product whose expression is detectable and/or quantifiable by physical, immunological, chemical, biochemical, or biological assays. A reporter gene product may, for example, have one of the following attributes, without restriction: a specific nucleic acid/chip hybridization pattern, fluorescence (e.g., green fluorescent protein), enzymatic activity (e.g., lacZ/β-galactosidase, luciferase, chloramphenicol acetyltransferase), toxicity (e.g., ricin A), or an ability to be specifically bound by a second molecule (e.g., biotin or a detectably labeled antibody). It is understood that any engineered variants of reporter genes, which are readily available to one skilled in the art, are also included, without restriction, in the foregoing definition.

[0070] By “operably linked” is meant that a gene and a regulatory sequence are connected in such a way as to permit expression of the gene product under the control of the regulatory sequence. A promoter may also be operably linked to a gene such that expression of the gene product is under control of the promoter.

[0071] By “regulatory region” is meant a region that, when operably linked to a promoter and a gene (e.g., a reporter gene), is capable of modulating the expression of the gene from the promoter. Regulatory regions include, for example, nuclear hormone transcription factor binding sites such as those described herein and may be found in intronic sequence.

[0072] By “promoter” is meant a minimal sequence sufficient to direct transcription of an operably-linked gene.

[0073] By “substantially identical” is meant a polypeptide or nucleic acid exhibiting at least 50%, preferably 85%, more preferably 90%, and most preferably 95% identity to a reference amino acid or nucleic acid sequence. For polypeptides, the length of comparison sequences will generally be at least 16 amino acids, preferably at least 20 amino acids, more preferably at least 25 amino acids, and most preferably 35 amino acids. For nucleic acids, the length of comparison sequences will generally be at least 50 nucleotides, preferably at least 60 nucleotides, more preferably at least 75 nucleotides, and most preferably 110 nucleotides. One sequence may include additions or deletions (i.e., gaps) of 20% or less when compared to the second sequence. Optimal alignment of sequences may be conducted, for example, by the methods of Gish and States (Nature Genet. 3:266-272, 1993), Altshul et al. (J. Mol. Biol. 215:403-410, 1990), Madden et al. (Meth. Enzymol. 266:131-141, 1996), Althsul et al (Nucleic Acids Res. 25:3389-3402, 1997), or Zhang et al (Genome Res. 7:649-656,1997).

[0074] Sequence identity is typically measured using sequence analysis software with the default parameters specified therein (e.g., Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705). This software program matches similar sequences by assigning degrees of homology to various substitutions, deletions, and other modifications. Conservative substitutions typically include substitutions within the following groups: glycine, alanine, valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine.

[0075] By “substantially pure nucleic acid” is meant nucleic acid that is free of the genes which, in the naturally-occurring genome of the organism from which the nucleic acid of the invention is derived, flank the nucleic acid. The term therefore includes, for example, a recombinant nucleic acid that is incorporated into a vector; into an autonomously replicating plasmid or virus; into the genomic nucleic acid of a prokaryote or a eukaryote cell; or that exists as a separate molecule (e.g., a cDNA or a genomic or cDNA fragment produced by PCR or restriction endonuclease digestion) independent of other sequences. It also includes a recombinant nucleic acid that is part of a hybrid gene encoding additional polypeptide sequence.

[0076] By “high stringency conditions” is meant hybridization in 2×SSC at 40° C. with a DNA probe length of at least 40 nucleotides. For other definitions of high stringency conditions, see F. Ausubel et al., Current Protocols in Molecular Biology, pp. 6.3.1-6.3.6, John Wiley & Sons, New York, N.Y., 1994, hereby incorporated by reference.

[0077] By “modulates” is meant increase or decrease. Preferably, a compound that modulates LXR-mediated transcription, RXR-mediated transcription, ABC1 gene expression, HDL-C levels, or triglyceride levels does so by at least 5%, more preferably by at least 10%, and most preferably by at least 25% or even at least 50%.

[0078] By “purified antibody” is meant antibody which is at least 60%, by weight, free from proteins and naturally occurring organic molecules with which it is naturally associated. Preferably, the preparation is at least 75%, more preferably 90%, and most preferably at least 99%, by weight, antibody. A purified antibody may be obtained, for example, by affinity chromatography using recombinantly-produced protein or conserved motif peptides and standard techniques.

[0079] By “specifically binds” is meant an antibody that recognizes and binds to, for example, a human ABC1 polypeptide but does not substantially recognize and bind to other non-ABC1 molecules in a sample, e.g., a biological sample, that naturally includes protein. A preferred antibody binds to the ABC1 polypeptide sequence of FIG. 2A (SEQ ID NO: 5).

[0080] By “polymorphism” is meant that a nucleotide or nucleotide region is characterized as occurring in several different forms. A “mutation” is a form of a polymorphism in which the expression level, stability, function, or biological activity of the encoded protein is substantially altered.

[0081] By “LXR” is meant nuclear receptors LXRα and LXRβ. Preferred LXRs include human LXRα (GenBank accession no. Q13133) and human LXRβ (GenBank accession no. P55055)(see Apfel et al., Mol. Cell. Biol. 14:7025-7035, 1994; Willy et al., Genes Dev. 9:1033-1045, 1995; and Song et al., Proc. Natl. Acad. Sci. USA 91:10809-10813, 1995, each of which is hereby incorporated by reference).

[0082] By “RXR” is meant nuclear receptors RXRα, RXRβ, and RXRγ. Preferred RXRs include human RXRα (GenBank accession no. Q13133), human RXRβ (GenBank accession no. S37781), and human RXRγ (GenBank accession no. Q13133).

[0083] By “ABC transporter” or “ABC polypeptide” is meant any transporter that hydrolyzes ATP and transports a substance across a membrane. Preferably, an ABC transporter polypeptide includes an ATP Binding Cassette and a transmembrane region. Examples of ABC transporters include, but are not limited to, ABC1, ABC2, ABCR, and ABC8.

[0084] By “ABC1 polypeptide” is meant a polypeptide having substantial identity to an ABC1 polypeptide having the amino acid sequence of SEQ ID NO: 5.

[0085] By “ABC biological activity” or “ABC1 biological activity” is meant hydrolysis or binding of ATP, transport of a compound (e.g., cholesterol, interleukin-1) or ion across a membrane, or regulation of cholesterol or phospholipid levels (e.g., either by increasing or decreasing HDL-cholesterol or LDL-cholesterol levels).

[0086] The invention provides methods for treating patients having low HDL-C and/or higher than normal triglyceride levels by administering compounds that modulate ABC1 biological activity or expression. For example, the compounds may modulate the transcriptional activity of LXR/RXR heterodimers. Many compounds that modulate LXR transcriptional activity or RXR transcriptional activity are known in the art. Preferred compounds of the invention are oxysterols; additional compounds are described herein.

[0087] The invention also provides screening procedures for identifying therapeutic compounds (cholesterol-modulating, triglyceride-modulating, or anti-CVD pharmaceuticals) which can be used in human patients. Compounds that modulate ABC1 biological activity or expression are considered useful in the invention, as are compounds that modulate ABC concentration, protein stability, regulated catabolism, or its ability to bind other proteins or factors. In general, the screening methods of the invention involve screening any number of compounds for therapeutically active agents by employing any number of in vitro or in vivo experimental systems. Exemplary methods useful for the identification of such compounds are detailed below.

[0088] The methods of the invention simplify the evaluation, identification and development of active agents for the treatment and prevention of low HDL, higher than normal triglyceride levels, and CVD. In general, the screening methods provide a facile means for selecting natural product extracts or compounds of interest from a large population which are further evaluated and condensed to a few active and selective materials. Constituents of this pool are then purified and evaluated in the methods of the invention to determine their HDL-raising, triglyceride-lowering, anti-CVD activities, or a combination thereof.

[0089] Other features and advantages of the invention will be apparent from the following description of the preferred embodiments thereof.

BRIEF DESCRIPTION OF THE DRAWINGS

[0090]FIG. 1 shows the genomic sequence of human ABC1, including exons 1-50 (SEQ ID NO: 1). Capital letters denote exonic sequence, lower case letters denote 5′ regulatory sequence or intronic sequence. “Z” denotes any nucleotide or other, including no nucleotide. The numbering used herein for the nucleotides in SEQ ID No:1 assumes that no nucleotide is present at the positions denoted “z;” however, it would be readily apparent to one skilled in the art that the numbering of the nucleotides in these sequences would change if a nucleotide is present at some or all of the positions denoted “z.” “K” denotes nucleotides G or T; “Y” denotes nucleotides C or T; “R” denotes nucleotides A or G; “M” denotes nucleotides A or C; “S” denotes nucleotides C or G; “H” denotes nucleotides A, C, or T; “B” denotes nucleotides C, G, or T. Because of the identification of an ABC1 exon upstream of exon 0 that we previously disclosed, the numbering used herein to refer to ABC1 exons has been increased by one compared to our previously used numbering (U.S. Ser. No. 09/526,193; U.S. Ser. No. 60/124,702; U.S. Ser. No. 60/138,048; U.S. Ser. No. 60/139,600; U.S. Ser. No. 60/151,977). For example, the previously described exon 0 is referred to herein as exon 1.

[0091]FIG. 2A is the amino acid sequence of the human ABC1 protein (SEQ ID NO: 5). FIG. 2B is the nucleotide sequence of the human ABC1 cDNA (SEQ ID NO: 6).

[0092]FIG. 3 is a summary of locations of consensus transcription factor binding sites in the human ABC1 5′ regulatory sequence. The abbreviations are as follows: PPRE=peroxisome proliferator-activated receptor. SREBP=steroid response element-binding protein site. ROR=RAR-related orphan receptor. The numbering used herein for the locations of the transcription factor binding sites assumes that no nucleotide is present at the positions denoted “z” in SEQ ID NO: 1. For the polymorphisms in the promoter region, the numbering is based on the first base of the promoter as nucleotide number −1. For exon 1, the numbering is based on the first base of exon 1 as nucleotide number +1. For the 5′ end intron 1, the numbering is based on the first position in intron 1 as +1. For the 3′ end of intron 1, the numbering is based on the first base ′5 to the start of exon 2 as nucleotide number −1.

[0093]FIG. 4 is a table summarizing polymorphisms in the genomic ABC1 sequences.

[0094]FIGS. 5A and 5B are bar graphs showing the percent of heterozygotes or unaffected family members with HDL (FIG. 5A) and triglycerides (TG) (FIG. 5B) within a given range of percentiles for age and sex, based on the LRC criteria (Heiss et al., Circulation 62:IV-116-IV-136, 1980). A broad distribution of HDL levels was seen in the heterozygotes, extending up to the 31^(st) percentile for age and sex. Although there is overlap in the distribution of triglycerides between heterozygotes and unaffected family members, a larger portion of heterozygotes have triglyceride levels greater than the 80^(th) percentile for age and sex.

[0095]FIG. 6 is a table characterizing TD patients, ABC1 heterozygotes, and unaffected family members.

[0096]FIG. 7 is a table summarizing the incidence of CAD in ABC1 heterozygotes.

[0097]FIG. 8 is a graph showing the average HDL levels in heterozygotes for each mutation versus the efflux levels measured in a heterozygous carrier of each mutation. The HDL levels are expressed as the percentage of the mean HDL level in the unaffected members of that family. The efflux levels are highly correlated with the levels of HDL cholesterol and are associated with 82% of the variation in HDL cholesterol levels.

[0098]FIG. 9 is a table summarizing the HDL levels and presence or absence of CAD in ABC1 heterozygotes. In the R2144X and R909X ABC1 mutations, the codon encoding Arg2133 or Arg909 is mutated to a STOP codon resulting in truncation of the encoded protein. In the “Del E,D 1893,94” mutation, the codons encoding Glu1893 and Asp1894 are deleted. The “invs25+1G-->C” mutation converts the first nucleotide of intron 25 from a “G” to a “C,” removing a splice site. For the “del C6825-->2145X” mutation, the deletion of C6825 in the nucleotide sequence is a frame-shift mutation that results in a STOP codon at the codon corresponding to amino acid 2145 of the encoded protein. For the “CTC69524TT-->2203X” mutation, “CTC” is replaced by “TT” in the nucleotide sequence, resulting in the conversion of the codon encoding amino acid 2203 to a stop codon.

[0099]FIG. 10 is a table comparing the mean lipid levels in unaffected family members and ABC1 heterozygotes with either missense or sever mutations.

[0100]FIG. 11 is a schematic diagram of the ABC1 protein, illustrating the location of the mutations and the presence or absence of CAD in carriers of the mutations. The number (n) of heterozygotes who are 40 years or older and may have developed CAD are listed.

[0101]FIGS. 12A and 12B are pedigrees of two FHA kindreds, FHA3 and FHA1, respectively (Marcil et al., Lancet 354:1341-1346, 1999). Males are denoted by square symbols, females by circles. Individuals heterozygous for mutations are given half-shaded symbols, with the probands indicated by arrows. A diagonal line indicates a deceased individual. The youngest individuals have HDL cholesterol at higher percentile ranges than those in the older generations.

[0102]FIG. 13 is a bar graph showing the percentage of individuals less than 30 years of age and from 30 to less than 70 years of age with HDL cholesterol levels in a given percentile range. Younger individuals have a far broader distribution of HDL cholesterol levels, clearly indicating that the impact of ABC1 on HDL levels is influenced by age.

[0103]FIG. 14 is a table summarizing HDL and TG levels in different age groups for ABC1 heterozygotes and unaffected family members.

[0104]FIGS. 15A and 15B are graphs showing the mean HDL level in heterozygous males (FIG. 15A) and females (FIG. 15B) in 10 year age groups (plotted at the half-way point) compared to the 10^(th) percentile distribution in the LRC population (Heiss et al., supra). Error bars represent the standard deviation of each mean. The number of individuals in each group is shown under each data point. Beyond the age of 30, mean HDL levels in heterozygotes fall much lower than the 10^(th) percentile distribution; in contrast, mean HDL cholesterol levels in the heterozygotes less than 30 years old more closely approximate the 10^(th) percentile distribution.

[0105]FIGS. 16A and 16B are graphs showing the mean HDL (FIG. 16A) and triglyceride levels (FIG. 16B) in heterozygotes and unaffected family members falling within each tertile of BMI. The tertiles of BMI correspond to the following values: (1) BMI<21.4; (2) 21.4<BMI<25.1; (3) BMI>25.1.

[0106]FIG. 17 is a table showing the oligonucleotides and reaction conditions used for RFLP screening of ABC1 polymorphisms.

[0107]FIG. 18 is a picture of a gel showing RFLP genotyping of the R219K variant. The 177 base pair PCR product is not digested for the A allele, whereas the B allele is digested producing fragments of 107 and 70 base pairs.

[0108]FIG. 19 is a table showing the allele frequencies of polymorphisms in the ABC1 gene.

[0109]FIG. 20 is a table comparing MSD, MOD, and frequency of coronary events in R219K ABC1 variant carriers compared to controls.

[0110]FIG. 21 is a graph showing the event-free survival curves for carriers (AB+BB) and non-carriers (AA) of the R219K ABC1 variant. Carriers of the variant have a 29% increased event-free survival over the two years of the trial, compared with non-carriers.

[0111]FIG. 22 is a table showing the baseline demographics and lipid levels in the Regression Growth Evaluation Statin Study (REGRESS) cohort by R219K ABC1 genotype.

[0112]FIG. 23 is a table showing the lipid levels and CAD above and below the median age in R219KABC1 carriers and controls.

[0113]FIG. 24 is a bar graph showing the percent difference in HDL cholesterol levels between those greater and less than the median age (56.7 years) for each R219K genotype.

[0114]FIGS. 25A and 25B are graphs showing the correlations of HDL cholesterol (FIG. 25A) and efflux (FIG. 25B) with age, by R219K genotype.

[0115]FIGS. 26A and 26B are graphs showing the change in MSD (FIG. 26A) and MOD (FIG. 26B) by median age in carriers (AB+BB) and non-carriers (M) of the R219K ABC1 variant.

[0116]FIG. 27 is a table showing the ethnic distribution of the R219K ABC1 variant.

DETAILED DESCRIPTION OF THE INVENTION

[0117] We have previously discovered that the human ABC1 (also known as ABCA1) genomic region contains consensus binding sites for transcription factors such as LXRs, RXRs, PPARs, SREBPs, and RORs. In the present invention, we report the sequence of additional regions of the ABC1 regulatory region which also contain consensus binding sites for transcription factors. We also discovered that heterozygotes for ABC1 mutations have age-modulated decreases in HDL, increases in triglyceride levels, and significantly increased risk for CAD. Furthermore, this phenotype was highly correlated with efflux, clearly demonstrating that impairment of reverse cholesterol transport is associated with decreased plasma HDL cholesterol, increased triglyceride levels, and increased atherogenesis. Accordingly, the present invention features screening methods to identify therapies that increase ABC1 function, resulting in increased plasma HDL cholesterol, decreased triglyceride levels, protection against atherosclerosis, or a combination of these effects.

[0118] Genes play a significant role influencing HDL levels. Tangier disease (TD) was the first reported genetic HDL deficiency. Until recently, the molecular basis for TD was unknown, but now mutations in ABC1 have been identified in TD patients (described below). For example, we have identified two additional probands and their families, and confirmed linkage and refined the locus to a limited genomic region. Mutations in the ABC1 gene accounting for all four alleles in these two families were detected. A more frequent cause of low HDL levels is a distinct disorder, familial HDL deficiency (FHA). On the basis of independent linkage, meiotic recombinants and disease associated haplotypes, FHA was localized to a small genomic region encompassing the ABC1 gene. A mutation in a conserved residue in ABC1 segregated with FHA. Antisense reduction of the ABC1 transcript in fibroblasts was associated with a significant decrease in cholesterol efflux.

[0119] Cholesterol is normally assembled with intracellular lipids and secreted, but in TD the process is diverted and cholesterol is degraded in lysosomes. This disturbance in intracellular trafficking of cholesterol results in an increase in intracellular cholesterol ester accumulation associated with morphological changes of lysosomes and the Golgi apparatus and cholesteryl ester storage in histiocytes, Schwann cells, smooth muscle cells, mast cells and fibroblasts.

[0120] The clinical and biochemical heterogeneity in patients with TD has led to the possibility that genetic heterogeneity may also underlie this disorder. Considering this, we initially performed linkage analysis on these two families of different ancestries (TD-1 is Dutch, TD-2 is British; Frohlich et al., Clin. Invest. Med. 10:377-382, 1987) and confirmed that the genetic mutations underlying TD in these families were localized to the same 9q31 region, to which a large family with TD had been assigned (Rust et al., Nature Genetics 20:96-98, 1998). Detailed haplotype analysis, together with the construction of a physical map, refined the localization of this gene. Mutations in the ABC1 gene were found in TD.

[0121] FHA is much more common than TD, although its precise frequency is not known. While TD has been described to date in only 40 families, we have identified more than 40 FHA families in the Netherlands and Quebec alone. After initial suggestions of linkage to 9q31, thirteen polymorphic markers spanning approximately 10 cM in this region were typed and demonstrated the highest LOD score at D9S277. Analysis of the homozygosity of markers in the TD-2 proband, who was expected to be homozygous for markers close to TD due to his parents' consanguinity, placed the TD gene distal to D95127. Combined genetic data from TD and FHA families pointed to the same genomic segment spanning approximately 1,000 kb between D9S127 and D9S1866. The ABC1 transporter gene was contained within the minimal genomic region. RT-PCR analysis in one family demonstrated a deletion of leucine at residue 693 (693) in the first transmembrane domain of ABC1, which segregated with the phenotype of HDL deficiency in this family.

[0122] ABC1 is part of the ATP-binding cassette (ABC transporter) superfamily, which is involved in energy-dependent transport of a wide variety of substrates across membranes (Dean et al., Curr. Opin. Gen. Dev. 5:779-785, 1995). These proteins have characteristic motifs conserved throughout evolution which distinguish this class of proteins from other ATP binding proteins. In humans these genes essentially encode two ATP binding segments and two transmembrane domains (Dean et al., Curr. Opin. Gen. Dev. 5:779-785, 1995). We have now shown that the ABC1 transporter is crucial for intracellular cholesterol transport.

[0123] We have demonstrated that reduction of the ABC1 transcript using oligonucleotide antisense approaches results in decreased efflux, clearly demonstrating the link between alterations in this gene and its functional effects. TD and FHA now join the growing list of genetic diseases due to defects in the ABC group of proteins including cystic fibrosis (Zielenski, et al., Annu. Rev. Genet. 29:777-807, 1995), adrenoleukodystrophy (Mosser et al., Nature 361: 726-730, 1993), Zellweger syndrome (Gärtner et al., Nat. Genet. 1:23, 1992), progressive familial intrahepatic cholestatis (Bull et al., Nat. Genet. 18:219-224, 1998), and different eye disorders including Stargardt disease (Allikmets et al., Nat. Genet. 15:236-246, 1997), autosomal recessive retinitis pigmentosa (Allikmets et al., Science 277:1805-1807, 1997), and cone-rod dystrophy (Cremers et al., Hum. Mol. Genet. 7:355-362, 1998).

[0124] Patients with TD have been distinguished from patients with FHA on the basis that Tangier disease was an autosomal recessive disorder (OMIM 20540) while FHA is inherited as an autosomal dominant trait (OMIM 10768). Furthermore, patients with TD have obvious evidence for intracellular cholesterol accumulation which is not seen in FHA patients. It is now evident that heterozygotes for TD do have reduced HDL levels and that the same mechanisms underlie the HDL deficiency and cholesterol efflux defects seen in heterozygotes for TD as well as FHA. Furthermore, the more severe phenotype in TD represents loss of function from both alleles of the ABC1 gene.

[0125] ABC1 is activated by protein kinases, presumably via phosphorylation, which also provides one explanation for the essential role of activation of protein kinase C in promoting cholesterol efflux (Drobnick et al., Arterioscler. Thromb. Vasc. Biol. 15: 1369-1377, 1995). Brefeldin, which inhibits trafficking between the endoplasmic reticulum and the Golgi, significantly inhibits cholesterol efflux, essentially reproducing the effect of mutations in ABC1, presumably through the inhibition of ABC1 biological activity. This finding has significance for the understanding of mechanisms leading to premature atherosclerosis. TD homozygotes develop premature coronary artery disease, as seen in the proband of TD-1 (III-01) who had evidence for coronary artery disease at 38 years. This is particular noteworthy as TD patients, in addition to exhibiting significantly reduced HDL, also have low LDL cholesterol, and yet they develop atherosclerosis despite this. This highlights the importance of HDL intracellular transport as an important mechanism in atherogenesis. There is significant evidence that heterozygotes for TD are also at increased risk for premature vascular disease (Schaefer et al., Ann. Int. Med. 93:261-266, 1980; Serfaty-Lacrosniere et al., Atherosclerosis 107:85-98, 1994). There is also preliminary evidence for premature atherosclerosis in some probands with FHA (e.g., the proband in FHA-2 (111-01) had a coronary artery bypass graft at 46 years while the proband in FHA-3 had evidence for CAD around 50 years of age. The TD-1 proband had more severe efflux deficiency than the TD-2 proband. Interestingly, the TD-2 proband had no evidence for CAD by 62 when he died of unrelated causes, providing preliminary evidence for a relationship between the degree of cholesterol efflux (mediated in part by the nature of the mutation) and the likelihood of atherosclerosis.

[0126] The ABC1 gene plays a crucial role in cholesterol transport and, in particular, intracellular cholesterol trafficking in monocytes and fibroblasts. It also appears to play a significant role in other tissues such as the nervous system, GI tract, and the cornea. Completely defective intracellular cholesterol transport results in peripheral neuropathy, corneal opacities, and deposition of cholesterol esters in the rectal mucosa.

[0127] HDL deficiency is heterogeneous in nature. The delineation of the genetic basis of TD and FHA underlies the importance of this particular pathway in intracellular cholesterol transport, and its role in the pathogenesis of atherosclerosis. Unraveling of the molecular basis for TD and FHA defines a key step in a poorly defined pathway of cholesterol efflux from cells and could lead to new approaches to treatment of patients with HDL deficiency in the general population.

[0128] HDL has been implicated in numerous other biological processes, including but not limited to: prevention of lipoprotein oxidation; absorption of endotoxins; protection against Trypanosoma brucei infection; modulation of endothelial cells; and prevention of platelet aggregation (see Genest et al., J. Invest. Med. 47: 3142, 1999, hereby incorporated by reference). Any compound that modulates HDL levels may be useful in modulating one or more of the foregoing processes. Our previous discovery that ABC1 functions to regulate HDL levels links, for the first time, ABC1 with the foregoing processes.

[0129] With the identification of the ABC1 protein as a key initiator of the efflux pathway, it has now been possible to directly examine the relationship between efflux, HDL, triglyceride level, and CAD. We have characterized the phenotypes of heterozygotes for several mutations in the ABC1 gene in a large cohort where diagnosis has been made by mutation identification. Furthermore, the phenotype of the heterozygotes was compared to that of unaffected family members, enabling the results to be controlled, at least in part, for other genetic and environmental influences. In contrast, prior studies in obligate heterozygotes have been limited to small numbers, often within a single family, and thus restricted in the ability to analyze the phenotypic expression with multiple mutations over a range of ages.

[0130] A cohort of 77 individuals heterozygous for multiple mutations in the ABC1 gene were identified, enabling the characterization of 13 ABC1 mutations in 11 families (5 TD, 6 FHA). The ABC1 heterozygotes have an approximate 50% decrease in HDL cholesterol and apoAI, and a mild but significant decrease in apoAII. In addition, ABC1 heterozygotes have increased triglycerides, but in contrast to TD patients, have no significant change in total or LDL cholesterol. The changes in HDL, apoAI, and triglycerides were gene-dose dependent, suggesting that they are directly related to ABC1 function. Furthermore, heterozygotes have an over three-fold increased risk of developing CAD, and younger average age-of-onset compared to unaffected individuals. Further, the heterozygotes with the most severe deficiency in efflux had a higher frequency and greater severity of CAD. Interestingly, the severity of the phenotype observed in the heterozygotes appeared to be mutation-dependent, but there was no obvious relationship between the site of mutation and the phenotype. There was a trend toward lower HDL in carriers of severe mutations that caused truncations or null alleles than in carriers of missense mutations. One notable exception is the M1091T missense mutation which had the most severe phenotype, with marked reductions in HDL cholesterol and efflux in affected family members, suggesting that this mutation may act in a dominant-negative fashion, down-regulating the function of the wild-type allele. Another interesting finding is the small cluster of mutations at the very C-terminal region of the protein, which suggests that this region is critical for ABC1 function.

[0131] The severe HDL deficiency in ABC1 heterozygotes suggests that residual cholesterol efflux is the major determinant of HDL cholesterol levels. Here we demonstrated a strong correlation between cholesterol efflux and HDL cholesterol levels. From the regression equation of mean HDL on efflux, each 8% increase in relative efflux is predicted to be associated with a 0.1 mmol/L increase in HDL cholesterol levels. For example, to effect a 30% increase in HDL cholesterol in a 40 year old male, it would require a 50% increase in ABC1 mediated cholesterol efflux. Although these numbers may not directly extrapolate to what is observed in a general population where other genetic and environmental factors have not been controlled for, these data nonetheless suggest that relatively small changes in ABC1 function may have a significant impact on plasma HDL cholesterol levels. Furthermore, the data presented here suggest that variations in efflux due to variations in ABC1 function directly reflect not only plasma HDL cholesterol levels but also triglyceride levels and CAD susceptibility, thus providing direct validation of the reverse cholesterol transport hypothesis and validation of ABC1 as a therapeutic target to raise HDL cholesterol, lower triglyceride levels, and protect against atherosclerosis.

[0132] The phenotype in ABC1 heterozygotes is also age-modulated. From 20 years of age in members of the control cohort, there is a small but definite increase in HDL with advancing age that is obviously absent in the heterozygotes. One explanation for this finding is that there is normally an age-related increase in ABC1 function, which is not seen in heterozygotes, perhaps because the remaining functioning allele has already been maximally up-regulated secondary to an increase in intracellular cholesterol. This lack of age-related increase in ABC1 function in heterozygotes would exaggerate the difference in HDL levels between heterozygotes and control individuals in older age groups. There is some evidence for an age-modulated increase in expression of ABC transporters (Gupta, Drugs Aging 7:19-29, 1995). Further, evidence of a potential age-related increase in ABC1 function comes from the observation that the percentage of apoAI found in the preβ₁ subfraction of HDL, the predominant cholesterol acceptors, decreases with age, suggesting increased formation of mature α-migrating HDL with age.

[0133] In a Regression Growth Evaluation Statin Study, carriers of the R219K ABC1 mutation were found to have significantly lower triglyceride levels that individuals without this mutation. This result suggests that compounds that bind near Arg219 in wild-type ABC1 or otherwise mimic the function provided by Lys219 in the R219K ABC1 variant may lower triglyceride levels, and thus decrease risk of CAD. In addition, carriers of the V399A ABC1 variant had higher HDL levels and fewer coronary events than individuals without this variant. Thus, compounds that bind near Val399 in wild-type ABC1 or mimic the function provided by Ala399 in the V399A ABC1 variant may increase cholesterol levels and decrease risk of CAD. Determining the presence or absence of the R219K or V399A ABC1 variants in individuals may be useful in selecting therapies (such as HDL-lowering, triglyceride-raising, or anti-CAD therapies) for these subjects.

[0134] The following examples are to illustrate the invention. They are not meant to limit the invention in any way.

[0135] Analysis of TD Families

[0136] Studies of Cholesterol Efflux

[0137] Both probands had evidence of marked deficiency of cholesterol efflux similar to that previously demonstrated in TD patients. TD-1 is of Dutch descent while TD-2 is of British descent.

[0138] Linkage Analysis and Establishment of a Physical Map

[0139] Multiple DNA markers were genotyped in the region of 9q31 to which linkage to TD had been described (Rust et al., Nat. Genet. 20, 96-98, 1998). Two point linkage analysis gave a maximal peak LOD score of 6.49 at D9S1832 with significant evidence of linkage to all markers in a ˜10 cM interval. Recombination with the most proximal marker, D9S1690 was seen in II-09 in Family TD-1, providing a centromeric boundary for the disease gene. Multipoint linkage analysis of these data did not increase the precision of the positioning of the disease trait locus.

[0140] A physical map spanning approximately 10 cM in this region was established with the development of a YAC contig. In addition, 22 other polymorphic multi-allelic markers which spanned this particular region were mapped to the contig, and a subset of these were used in construction of a haplotype for further analysis.

[0141] While the family of Dutch decent did not demonstrate any consanguinity, the proband in TD-2 was the offspring of a first-cousin consanguineous marriage. We postulated, therefore, that it was most likely that this proband would be homozygous for the mutation while the proband in the Dutch family was likely to be a compound heterozygote. The Dutch proband shows completely different mutation bearing haplotypes, supporting this hypothesis.

[0142] The TD-2 proband was homozygous for all markers tested distal to D9S127 but was heterozygous at D9S127 and DNA markers centromeric to it. This suggested that the gene for TD was likely located to the genomic region telomeric of D9S127 and encompassed by the markers demonstrating homozygosity.

[0143] Mutation Detection

[0144] Based on the defect in intracellular cholesterol transport in patients with TD, we reviewed the EST database for genes in this region which might be relevant to playing a role in this process. The ABC1 transporter gene had previously been mapped to 9q31, but its precise physical location had not been determined (Luciani et al., Genomics 21:150-159, 1994). The ABC1 gene is a member of the ATP binding cassette transporters which represents a super family of highly conserved proteins involved in membrane transport of diverse substrates including amino acids, peptides, vitamins and steroid hormones (Luciani et al., Genomics 21:150-159, 1994; Dean et al., Curr. Opin. Gen. Dev. 5:779-785, 1995). Primers to the 3′ UTR of this gene mapped to YACs spanning D9S306 (887-B2 and 930-D3) compatible with it being a strong candidate for TD. We initiated large scale genomic sequencing of BACs spanning approximately 800 kb around marker D9S306 (Research Genetics RPCI-11 BACs: 418, 31J20, 47019, and 179G21, which are publicly available from Research Genetics which is located at 2130 Memorial Parkway, Huntsville, Ala. 35801) BACs 4I8, 31J20, 47O19, and 179G21 are identical to previously described BACs 269, 274, 279 and 291, respectively (U.S. Ser. No. 09/526,193; U.S. Ser. No. 60/124,702; U.S. Ser. No. 60/138,048; U.S. Ser. No. 60/139,600; U.S. Ser. No. 60/151,977; Brooks-Wilson et al., Nat. Genet. 22:336-345, 1999). The ABC1 gene was revealed encompassing 49 exons and a minimum of 75 Kb of genomic sequence. In view of the potential function of a gene in this family as a cholesterol transporter, its expression in fibroblasts and localization to the minimal genomic segment underlying TD, we formally assessed ABC1 as a candidate.

[0145] Patient and control total fibroblast RNA was used in Northern blot analysis and RT-PCR and sequence analyses. RT-PCR and sequence analysis of TD-1 revealed a heterozygous T to C substitution in the TD-1 proband, which would result in a substitution of arginine for cysteine at a conserved residue between mouse and man. This mutation, confirmed by sequencing exon 31 of the ABC1 gene, exhibited complete segregation with the phenotype on one side of this family. This substitution creates a HgaI site, allowing for RFLP analysis of amplified genomic DNA and confirmation of the mutation. The point mutation in exon 31 was not seen on over 200 normal chromosomes from unaffected persons of Dutch decent, and 250 chromosomes of Western European decent, indicating it is unlikely to be a polymorphism. Northern blot analysis of fibroblast RNA from this patient, using a cDNA encompassing exons 2 to 50 of the gene, revealed a normal sized ˜8 Kb transcript and a truncated mutant transcript which was not visible in control RNA or in RNA from other patients with HDL deficiency. Additionally, Northern blot analysis using clones encompassing discrete regions of the cDNA revealed that the mutant transcript was detected with a cDNA compassing exons 2 to 50, 2 to 42, 2 to 23, much more faintly with a probe spanning exon 24 to 30, and not seen with probes encompassing exons 31 to 43 or a probe spanning exons 31 to 50. This was repeated on multiple filters with control RNA, RNA from other patients with HDL deficiency and the other TD proband, and only in TD-1 was the truncated transcript observed. Sequence analysis of the coding region did not reveal an alteration in sequence that could account for this finding. Furthermore, DNA analysis by Southern blot did not reveal any major rearrangements. Completion of exon sequencing in genomic DNA showed that this mutation was a G to C transversion at position (+1) of intron 24, affecting a splice donor site and causing aberrant splicing.

[0146] RT-PCR analysis of fibroblast RNA encoding the ABC1 gene from the proband in TD-2 revealed a homozygous nucleotide change of A to G at nucleotide 1864 in exon 14, resulting in a substitution of arginine for glutamine at residue 597, occurring just proximal to the first predicted transmembrane domain of ABC1 at a residue conserved in mouse and as well as a C. elegans homolog. This mutation creates a second AciI site within exon 14. Segregation analysis of the mutation in this family revealed complete concordance between the mutation and the low HDL phenotype as predicted. The proband in TD-2 is homozygous for this mutation, consistent with our expectation of a disease causing mutation in this consanguineous family.

[0147] Analysis of FHA Families

[0148] Linkage Analysis and Refinement of the Minimal Genomic Region Containing the Gene for FHA

[0149] Data from microsatellite typing of individual family members from the four pedigrees of French Canadian origin were analyzed. A maximum LOD score of 9.67 at a recombination fraction of 0.0 was detected at D9S277 on chromosome 9q31. Thereafter, 22 markers were typed in a region spanning 10 cM around this locus in these families. The frequency for these markers were estimated from a sample of unrelated and unaffected subjects of French ancestry.

[0150] TD and FHA have thus far been deemed distinct with separate clinical and biochemical characteristics. Even though the genes for these disorders mapped to the same region, it was uncertain whether FHA and TD were due to mutations in the same gene or, alternatively, due to mutations in genes in a similar region. Refinement of the region containing the gene for FHA was possible by examining haplotype sharing and identification of critical recombination events. Seven separate meiotic recombination events were seen in these families, clearly indicating that the minimal genomic region containing the potential disease gene was a region of approximately 4.4 cM genomic DNA spanned by marker D9S1690 and D9S1866. This region is consistent with the results of two point linkage analysis which revealed maximal LOD scores with markers D9S277 and D9S306 and essentially excluded the region centromeric to D9S1690 or telomeric to D9S1866. An 8^(th) meiotic recombination event further refined the FHA region to distal to D9S277.

[0151] As described herein, the ABC1 gene mapped within this interval. The overlapping genetic data strongly suggested that FHA may in fact be allelic to TD. Utilization of sets of genetic data from FHA and TD provided a telomeric boundary at D9S1866 (meiotic recombinant) and a centromeric marker at D9S127 based on the homozygosity data of TD-2. This refined the locus to approximately 1 mb between D9S127 and D9S1866. The ABC1 gene mapped within this minimal region.

[0152] Mutation Detection in FHA

[0153] Mutation assessment of the ABC1 gene was undertaken in FHA-1. Using primers that spanned overlapping segments of the mRNA we performed RT-PCR analysis and subjected these fragments to mutational analysis. A deletion of three nucleotides is evident in the RT-PCR sequence of FHA-1 III.01, resulting in a loss of nucleotides 2151-2153 and deletion of a leucine (L693) at amino acid position 693. This leucine is conserved in mouse and C. elegans. The alteration was detected in the RT-PCR products as well as in genomic sequence from exon 15 specific amplification. This mutation results in a loss of an EarI restriction site. Analysis of genomic DNA from the family indicated that the mutation segregated completely with the phenotype of HDL deficiency. The loss of the EarI site results in a larger fragment being remaining in persons heterozygous for this mutation. This mutation maps to the first putative transmembrane domain of ABC1 and was not seen in 130 chromosomes from persons of French Canadian descent nor seen in over 400 chromosomes from persons of other Western European ancestry.

[0154] A mutation has also been found in patient genomic DNA in pedigree FHA-3 from Quebec. The alteration, a 6 bp deletion of nucleotides 5752-5757 within exon 42, results in a deletion of amino acids 1893 (Glu) and 1894 (Asp). The deletion was detected as a double superimposed sequence starting from the point of the deletion, and was detected in sequence reads in both directions. The deletion can be detected on 3% agarose or 10% polyacrylamide gels, and segregates with disease in FHA-3. It was not seen in 128 normal chromosomes of French-Canadian origin or in 434 other control chromosomes. Amino acids 1893 and 1894 are in a region of the ABC1 protein that is conserved between human, mouse, and C. elegans, implying that it is of functional importance.

[0155] An additional mutation has been found in patient genomic DNA in pedigree FHA-2 from Quebec. The alteration, a C to T transition at position 6504, converts an arginine at position 2144 to a STOP codon, causing truncation of the last 118 amino acids of the ABC1 protein. This alteration segregates with disease in family FHA-2.

[0156] Functional Relationship Between Changes in ABC1 Transcript Levels and Cholesterol Efflux

[0157] Antisense approaches were undertaken to decrease the ABC1 transcript and assess the effect of alteration of the transcript on intracellular cholesterol transport. The use of antisense primers to the 5′ end of ABC1 clearly resulted in a decrease to approximately 50% of normal RNA levels. This would be expected to mimic in part the loss of function due to mutations on one allele, similar to that seen in heterozygotes for TD and patients with FHA. Importantly, reduction in the mRNA for the ABC1 gene resulted in a significant reduction in cellular cholesterol efflux, further establishing the role of this protein in reverse cholesterol transport and providing evidence that the mutations detected are likely to constitute loss of function mutations. Furthermore, these data support the functional importance of the first 60 amino acids of the protein. Antisense oligonucleotide AN-6 is directed to the novel start codon 5′ to the one indicated in AJ012376.1; this antisense oligonucleotide effectively suppresses efflux.

[0158] Polymorphisms in ABC1 5′ Regulatory Region and 5′ UTR

[0159] Several polymorphisms in the 5′ regulatory region of human ABC1 (SEQ ID NO: 1) have been identified (FIG. 4). Because of their location, it is likely that ABC1 gene expression will differ among humans having different promoter polymorphisms, and these individuals may also respond differently to the same drug treatment. Thus, using these newly-identified polymorphisms, one can tailor drug treatment depending on which polymorphism(s) is/are present in a patient. The presence or absence of particular ABC1 polymorphisms may also be used in determining an individual's predisposition to developing CVD.

[0160] The methods of the invention may be performed using the following materials and methods.

[0161] Biochemical Studies

[0162] Blood is withdrawn in EDTA-containing tubes for plasma lipid, lipoprotein cholesterol, ApoAI, and triglyceride analyses, as well as storage at −80° C. Leukocytes arere isolated from the buffy coat for DNA extraction.

[0163] Lipoprotein measurement is performed on fresh plasma as described elsewhere (Rogler et al., Arterioscler. Thromb. Vasc. Biol. 15:683-690, 1995). Lipids, cholesterol and triglyceride levels are determined in total plasma and plasma at density d<1.006 g/mL (obtained after preparative ultracentrifugation) before and after precipitation with dextran manganese. Apolipoprotein measurement is performed by nephelometry for ApoB and ApoAI.

[0164] Genomic Clone Assembly and Physical Map Construction of the 9q31 Region

[0165] Using the Whitehead Institute/MIT Center for Genome Research map as a reference, the genetic markers of interest at 9q31 were identified within YAC contigs. Additional markers that mapped to the approximate 9q31 interval from public databases and the literature were then assayed against the YAC clones by PCR and hybridization analysis. The order of markers was based on their presence or absence in the anchored YAC contigs and later in the BAC contig. Based on the haplotype analysis, the region between D9S277 and D9S306 was targeted for higher resolution physical mapping studies using bacterial artificial chromosomes (BACs). BACs within the region of interest were isolated by hybridization of DNA marker probes and whole YACs to high-density filters containing clones from the RPCI-11 human BAC library.

[0166] Sequence Retrieval and Alignment

[0167] The human ABC1 mRNA sequence was retrieved from GenBank using the Entrez nucleotide query (Baxevanis et al., A Practical Guide to the Analysis of Genes and Proteins, eds. Baxevanis, A. D. & Ouellette, B. F. F. 98:120, 1998) as GenBank accession number AJ012376.1. The version of the protein sequence we used as wild-type (normal) was CAA10005.1.

[0168] We identified an additional 60 amino acids in-frame with the previously-believed start methionine. Bioinformatic analysis of the additional amino acids indicates the presence of a short stretch of basic amino acid residues, followed by a hydrophobic stretch, then several polar residues. This may represent a leader sequence, or another transmembrane or membrane-associated region of the ABC1 protein. In order to differentiate among the foregoing possibilities, antibodies directed to the region of amino acids 1-60 are raised against and used to determine the physical relationship of amino acids 1-60 in relation to the cell membrane. Other standard methods can also be employed, including, for example, expression of fusion proteins and cell fractionation.

[0169] The mouse ABC1 sequence used has accession number X75926. It is very likely that this mouse sequence is incomplete, as it lacks the additional 60 amino acids described herein for human ABC1.

[0170] Version 1.7 of ClustalW was used for multiple sequence alignments with BOXSHADE for graphical enhancement (www.isrec.isb-sib.ch:8080/software/BOX_(—)form.html) with the default parameter. A Caenorhabditis elegans ABC1 orthologue was identified with BLAST (version 2.08) using CM1005.1 (see above) as a query, with the default parameter except for doing an organism filter for C. elegans. The selected protein sequence has accession version number MC69223.1 with a score of 375, and an E value of 103.

[0171] Genomic DNA Sequencing

[0172] BAC DNA was extracted from bacterial cultures using NucleoBond Plasmid Maxi Kits (Clontech, Palo Alto, Calif.). For DNA sequencing, a sublibrary was first constructed from each of the BAC DNAs (Rowen et al., Automated DNA Sequencing and Analysis, eds. Adams, M. D., Fields, C. & Venter, J. C., 1994). In brief, the BAC DNA was isolated and randomly sheared by nebulization. The sheared DNA was then size fractionated by agarose gel electrophoresis and fragments above 2 kb were collected, treated with Mung Bean nuclease followed by T4 DNA polymerase and klenow enzyme to ensure blunt-ends, and cloned into SmaI-cut M13 mp19. Random clones were sequenced with an ABI373 or 377 sequencer and fluorescently labeled primers (Applied BioSystems, Foster City, Calif.). DNAStar software was used for gel trace analysis and contig assembly. All DNA sequences were examined against available public databases primarily using BLASTn with RepeatMasker (University of Washington). The sequence of each of the assembled contigs is shown in FIGS. 1A-D.

[0173] Reverse Transcription (RT)-PCR Amplification and Sequence Analysis

[0174] Total RNA was isolated from the cultured fibroblasts of TD and FHA patients, and reverse transcribed with a CDS primer containing oligo d(T)18 using 250 units of SuperScript II reverse transcriptase (Life Technologies, Inc., Rockville, Md.) as described (Zhang et al., J. Biol. Chem. 27:1776-1783, 1996).

[0175] cDNA was amplified with Taq DNA polymerase using primers derived from the published human ABC1 cDNA sequence (Luciani et al., Genomics 21:150-159,1994). Six sets of primer pairs were designed to amplify each cDNA sample, generating six DNA fragments which are sequentially overlapped covering 135 to 7014 bp of the full-length human ABC1 cDNA. The nucleotides are numbered according to the order of the published human cDNA sequence (AJ012376.1). Primer pairs (1): 135-158 (f) and 1183-1199 (r); (2): 1080-1107 (f) and 2247-2273 (r); (3): 2171-2197 (f) and 3376-3404 (r); (4): 3323-3353 (f) and 4587-4617 (r); (5) 4515-4539 (f) and 5782-5811 (r); (6): 5742-5769 (f) and 6985-7014 (r). RT-PCR products were purified by Qiagen spin columns. Sequencing was carried out in a Model 373A Automated DNA sequencer (Applied Biosystems) using Taq di-deoxy terminator cycle sequencing and Big Dye Kits according to the manufacturer's protocol.

[0176] Northern Blot Analysis

[0177] Northern transfer and hybridizations were performed essentially as described (Zhang et al., J. Biol. Chem. 27:1776-1783, 1996). Briefly, 20 □g of total fibroblast RNA samples were resolved by electrophoresis in a denaturing agarose (1.2%; w/v) gel in the presence of 7% formaldehyde, and transferred to nylon membranes. The filters were probed with ³²P-labeled human ABC1 cDNA as indicated. Pre-hybridization and hybridizations were carried out in an ExpressHyb solution (ClonTech) at 68° C. according to the manufacturer's protocol.

[0178] Cell Culture

[0179] Skin fibroblast cultures are established from 3.0 mm punch biopsies of the forearm of FHD patients and healthy control subjects as described (Marcil et al., Arterioscler. Thromb. Vasc. Biol. 19:159-169, 1999).

[0180] Cellular Cholesterol Labeling and Loading

[0181] The protocol for cellular cholesterol efflux experiments has been described in detail elsewhere (Marcil et al., Arterioscler. Thromb. Vasc. Biol. 19:159-169, 1999). The cells are ³H-cholesterol labeled during growth and free cholesterol loaded in growth arrest.

[0182] Cholesterol Efflux Studies

[0183] Efflux studies are carried out from 0 to 24 hours in the presence of purified ApoAI (10 μg protein/mL medium). Efflux is determined as a percent of free cholesterol in the medium after the cells were incubated for specified periods of time. All experiments are preferably performed in triplicate, in the presence of cells from one control subject and the cells from the study subjects to be examined.

[0184] Determination of Genomic Structure of the ABC1 Gene

[0185] Most splice junction sequences were determined from genomic sequence generated from BAC clones spanning the ABC1 gene. More than 160 kb of genomic sequence were generated. Genomic sequences were aligned with cDNA sequences to identify intron/exon boundaries. In some cases, long distance PCR between adjacent exons was used to amplify intron/exon boundary sequences using amplification primers designed according to the cDNA sequence. The genomic sequence of human ABC1 is shown in FIGS. 1A-D.

[0186] Analysis of ABC1 Heterozygotes

[0187] Identification of Subjects

[0188] Subjects heterozygous for mutations in the ABC1 gene were individuals identified from the seven TD and FHA families previously described (Brooks-Wilson et al., supra; Marcil et al., supra). In addition, heterozygous individuals from three new Tangier disease families (TD3-5) and one new FHA kindred (FHA6) were included. The second mutation has not been identified in one of the TD kindreds (TD4); however, a marker immediately adjacent to ABC1 cosegregates with the low HDL phenotype. Individuals bearing the affected haplotype were considered heterozygotes. The presence or absence of mutations identified by genomic sequencing of probands from each family was subsequently confirmed by restriction fragment length polymorphism (RFLP) assays, to define heterozygous and unaffected individuals, respectively.

[0189] The control cohort consisted of unaffected members of the 11 families. These individuals share a genetic background with the heterozygotes, and environmental factors are expected to be similar amongst family members. Thus, many additional factors that influence HDL are controlled for, and the phenotypic differences between heterozygotes and unaffected individuals can be largely attributed to variation in ABC1 gene activity.

[0190] All subjects gave informed consent to their participation in this study, and the genetic analysis protocol was approved by the Ethics committees of the University of British Columbia, the Academic Medical Centre in Amsterdam and the Clinical Research Institute of Montreal (IRCM).

[0191] Lipid and Cholesterol Efflux Measurements

[0192] Lipid levels in ABCA1 heterozygotes were measured as previously described (Brooks-Wilson et al., supra; Marcil et al., supra), at standardized lipid clinics in Vancouver, Montreal and Amsterdam. LDL was calculated by the method of Friedewald et al. (Clin. Chem. 18:499-502, 1972), modified to account for lipid measurements in mmol/L.

[0193] Cellular cholesterol efflux from fibroblast cultures was measured as previously described (Brooks-Wilson et al., supra; Marcil et al., supra). Each experiment was performed in triplicate wells and averaged. Measurements are reported as the percentage efflux in each subject relative to an average of at least two healthy controls included within the same experiment. Individual experiments were repeated at least twice, and the average relative efflux over all experiments was used.

[0194] Statistics

[0195] In analysis of the heterozygotes, differences in mean baseline demographics and lipid levels between groups were compared by Student's t-test. Comparisons of frequency either between the male to female ratio or of distributions across various percentile ranges were made using the chi-square test. Analyses of potential interactions between affected status and either sex or BMI were performed using a general linear model. Statistical analysis was performed using Prism (version 3.00, Graphpad Software) or Systat (version 8.0, SPSS Inc.). All values are reported as mean A standard deviation.

[0196] Decreased HDL Cholesterol and an Increased Risk for CAD in ABC1 Heterozygotes

[0197] The analyzed cohort comprised 77 individuals from 11 families identified as heterozygous for mutations in the ABC1 gene. A comparison of mean lipid levels in heterozygotes with mean levels in all available unaffected family members (n=156) is presented in FIG. 6. Heterozygotes have an approximately 40-45% decrease in HDL and apoA-1 and a mild (approximately 10%) decrease in apoA-II compared to unaffected family members. Mean triglycerides (TG) were increased by approximately 40% in heterozygotes compared to unaffected family members, and were further increased in TD patients. Unlike TD patients, there is no significant decrease in either total cholesterol (TC) or LDL cholesterol in heterozygotes, and apoB levels were not different in heterozygotes from controls. Mean HDL levels in carriers of each of the mutations were similarly reduced by approximately 40-50% compared to unaffected family members (FIG. 9).

[0198] The heterozygote phenotype was further examined by calculating the percentage of individuals falling within a given range of age and sex specific percentiles (based on LRC criteria (Heiss et al., supra; Heiss et al., Circulation 61:302-315, 1980). Much variability in the heterozygote phenotype was evident. As illustrated in FIG. 5A, although a significantly higher percentage of heterozygotes had HDL cholesterol less than the 5^(th) percentile for age and sex compared to unaffected controls (65% vs. 5%, p<0.0001), 5% of the heterozygotes had HDL greater than the 20^(th) percentile, with HDL ranging up to the 31^(st) percentile for age and sex. Thus, in some individuals clearly the low HDL phenotype is less severe. A broad distribution of triglyceride (TG) levels was also evident (FIG. 5B). A significantly lower percentage of heterozygous individuals had TG below the 20^(th) percentile for age and sex (p=0.03), and a significantly larger percentage had TG>80^(th) percentile (p=0.005) compared to unaffected family members, but substantial overlap between the two distributions was seen.

[0199] Another important question is whether individuals heterozygous for ABC1 mutations are at an increased risk of developing coronary artery disease (CAD). In our large cohort, symptomatic vascular disease was over three times as frequent in the adult heterozygotes as in unaffected family members (FIG. 6). The forms of vascular disease were generally more severe in the heterozygotes than in their unaffected family members (FIG. 7). Heterozygotes had myocardial infarctions (five, one fatal) and severe vascular disease requiring multiple interventions, whereas in unaffected individuals, CAD was manifest as angina in two cases and as a transient ischemic attack at the age of 80 in another. Furthermore, the mean age-of-onset was on average a decade younger in heterozygotes compared to unaffected controls (FIG. 6)

[0200] Cholesterol Efflux, HDL Level, and CAD in ABC1 Heterozygotes

[0201] We have previously shown that individuals heterozygous for ABC1 mutations have decreased cholesterol efflux. In the present study, the relationship between cholesterol efflux levels, HDL cholesterol levels, and CAD was further assessed. Relative cholesterol efflux in individuals heterozygous for an ABC1 mutation was plotted against the mean HDL cholesterol levels observed in the carriers of that mutation, expressed as a percentage of the unaffected members within that family (FIG. 8). Cholesterol efflux levels associated with each mutation strongly predict the corresponding HDL cholesterol levels in the families, accounting for 82% of the variation in HDL cholesterol (r²=0.82, p=0.005). Furthermore, in one large family (FHA2), where efflux has been measured in three independent heterozygotes, an r² value of 0.81 was obtained when individual plasma HDL levels were plotted against individual efflux measurements. Using the regression equation of mean HDL levels in the heterozygotes on the efflux level of the heterozygous carrier (p=0.02), the relationship between expected changes in ABC1 efflux activity and HDL levels was estimated. Based on this analysis, we predict that each 8% change in efflux levels would be associated with a 0.1 mmol/L change in HDL cholesterol.

[0202] Relative cholesterol efflux levels are also related to CAD within the family. Families with clearest evidence for premature CAD had individuals with the lowest cholesterol efflux (bold on FIG. 8 and FIG. 9). These data suggest that the level of residual ABC1 function is a critical determinant of both HDL cholesterol levels and susceptibility to CAD.

[0203] Comparison of Mutation Type and Location to the Severity of Phenotype in Individuals Heterozygous for ABC1 Mutations

[0204] We have previously noted that the phenotypic presentation of our FHA heterozygotes was more severe than that of our TD heterozygotes. Furthermore, we initially noted more deletions and premature truncations of the protein in our FHA families than our TD families. Thus, as residual ABC1 activity is an important predictor of severity of the phenotype, the influence of the nature of the mutation on the phenotypic expression of mutations in the ABC1 gene was examined. Severe mutations which would be expected to result in a non-functional allele were defined as deletions or mutations that caused premature truncation of the protein (frameshifts and nonsense mutations) or that disrupted natural splicing of the protein. Missense mutations, on the other hand, result in the change of only a single amino acid and may result in a protein product that still retains partial activity.

[0205] Lipid levels were compared in heterozygous carriers of severe and missense mutations. While there was a trend to decreased HDL levels in carriers of severe compared to missense mutations, this trend did not reach significance (FIG. 10). A range of HDL levels in individual missense and severe mutations were observed (FIG. 9). The M1091T missense mutation is the most severe mutation in terms of effects on efflux and HDL levels, with a more severe phenotype than even early truncations of the protein (e.g. R909X). The site of mutation (e.g. N-terminal or C-terminal) within the ABC1 protein did not influence the phenotype (FIG. 11). The presence of CAD is seen in carriers of mutations in several domains of the protein. Patients with mutations on both alleles manifest with splenomegaly alone or in association with CAD (TD1). Thus the phenotype appears to be mutation specific, and most likely dependent on remaining ABC1 function of the wild-type allele and residual function of the mutant allele, similar to what has been shown for mutations in ABCR, a close homologue of ABC1 (van Driel et al., Ophthalmic Genet. 19:117-122, 1998).

[0206] Relationship Between Phenotype of Mutations in the ABC1 Gene and Age

[0207] One factor influencing phenotypic expression that became apparent in the families was age. We first characterized the effect of age on the HDL levels of individuals in two previously reported families (Marcil et al., supra) (FIGS. 12A and 12B). In family FHA3, while heterozygous individuals in generations II and III all had HDL cholesterol levels less than the 5^(th) percentile for age and sex, those in generation IV had a much more variable phenotype, with HDL cholesterol ranging up to the 20^(th) percentile. In family FHA1, the same pattern was observed.

[0208] The distribution of individuals less than 30 years old across HDL percentile ranges was compared to the corresponding distribution of individuals 30-70 years old across HDL percentile ranges (FIG. 13). A significantly larger percentage of individuals 30-70 years old had HDL cholesterol less than the 5^(th) percentile compared to the percentage of individuals less than 30 years old with HDL cholesterol less than the 5^(th) percentile. Mean HDL decreases in heterozygotes greater than 30 years of age compared to those less than 30 years of age; in contrast, there is no significant change in unaffected controls (FIG. 14). Similar results are seen in males and females separately and in both pre- and post-menopausal ages in women (FIGS. 15A and 15B). Triglyceride levels increase with age in both heterozygotes and unaffected family members.

[0209] Assessment of the Influences of Gender and BMI on the Phenotypic Expression of ABC1 Mutations

[0210] Females are known to have elevated HDL and decreased triglyceride levels compared to males. Thus, the phenotype of ABC1 heterozygotes was analyzed to determine whether the phenotype was influenced by gender. HDL cholesterol is significantly lower than unaffected controls in both heterozygous males and females (0.70 Å 0.24 versus 1.21+0.29, p<0.0001; 0.76 Å 0.25 versus 1.41 Å 0.38, p<0.0001, respectively). This was reflected in decreased apoAI (0.92 Å 0.27 versus 1.36 Å 0.22, p<0.0001; 0.92 Å 0.36 versus 1.49 Å 0.28, p<0.0001 in males and females, respectively), and a trend towards a mild decrease in apoAII in both males and females compared to unaffected family members (0.35 Å 0.08 versus 0.40 Å 0.09, p=0.08; 0.35 Å 0.09 versus 0.39 Å 0.07, p=0.06, respectively). Triglycerides are higher in both male (2.07 Å 2.16 versus 1.30 Å 1.30, p=0.02) and female (1.34 Å 0.86 versus 1.09 Å 0.63, p=0.08) heterozygotes compared to unaffected family members. The difference in HDL between males and females was reduced in heterozygotes compared to controls (p=0.11), while the difference in triglycerides was increased compared to controls (p=0.13).

[0211] Another factor known to influence HDL and triglyceride levels is BMI. The entire cohort was divided into tertiles of BMI, and the mean HDL and triglyceride levels of heterozygotes and unaffected individuals by BMI tertile are shown in FIGS. 16A and 16B. BMI had a significant effect on both HDL and triglycerides in both heterozygotes and controls (p<0.0001). The effect of BMI on HDL-C and triglyceride levels was more severe in heterozygotes for ABC1 than in controls, being evident at lower BMIs (mid-tertile) in heterozygotes. A raised BMI was more obviously associated with changes in HDL and triglyceride levels in heterozygotes compared to controls. However, neither effect reached statistical significance. HDL was reduced in heterozygotes compared to controls in all BMI tertiles (p<0.0001 in each tertile). While triglyceride levels were increased in all BMI tertiles in heterozygotes compared to unaffected family members, this difference was only significant in the middle BMI tertile (p=0.009).

[0212] Analysis of ABC1 SNPs from REGRESS Study

[0213] Identification of SNPs

[0214] SNPs in the ABC1 gene were identified during the complete genomic sequencing of 14 unrelated probands with low HDL-C (Brooks-Wilson et al., supra 1999; Marcil et al., supra 1999). Variants that were identified within the low HDL families that did not co-segregate with the low HDL phenotype or that were observed in unaffected individuals were assumed to be SNPs. Based on the sequencing of BAC clones spanning the entire ABC1 region (described above), sites identified as heterozygous or different from that found in sequenced individuals were also identified as polymporphisms. Sequence data was available from at least one control individual at all variant coding sites. The SNPs are numbered from the nucleotide described as position 1 (Pullinger et al., Biochemical and Biophysical Research Communications 271:451-455, 2000), naming the first exon number 1. As a standardized nomenclature for all variants, the “wild-type” allele (more frequent in the REGRESS population) is designated A, while the variant (less frequent) allele is designated B.

[0215] Subjects

[0216] To assess the effects of these SNPs on lipid levels and CAD, we studied a cohort of 804 men with proven coronary artery disease who participated in the Regression Growth Evaluation Statin Study (REGRESS), which has previously been described in detail (Jukema et al., Circulation 91:2528-2540, 1995). Briefly, study participants were required to have at least one coronary artery with a stenosis of more than 50% as assessed by coronary angiography, a plasma total cholesterol concentration of 4 to 8 mmol/L (155 to 310 mg/dL), and a plasma triglyceride concentration below 4 mmol/L (350 mg/dL). Phenotypic effects of the cSNPs were examined in relationship to baseline lipid parameters.

[0217] Patients were randomly assigned to treatment with pravastatin (Pravachol, Bristol-Myers Squibb, Princeton, N.J.) or placebo for a period of two years. Computer-assisted quantitative coronary angiography was carried out at the start and at the end of the study as previously described (Jukema et al., supra (1995)). The baseline values and changes in the average mean segment diameter (MSD), which is a measurement of the average unobstructed diameter along the vessel, and in the minimal obstructive diameter (MOD), which is a measurement of the smallest unobstructed segment, were used as the primary measures of CAD. The MSD reflects diffuse changes of atherosclerosis, and the MOD reflects focal atherosclerotic changes. Larger MSD and MOD measurements reflect less occlusion of the vessel, and a decrease in these parameters reflects progression of coronary atherosclerosis. In addition, the prevalence of coronary events; defined as death, myocardial infarction, unscheduled coronary angioplasty or bypass surgery (PTCA, CABG), or stroke/transient ischemic attack; was examined.

[0218] Blood was collected from each patient at baseline, and DNA was extracted according to standard procedures. Several subsequent genetic studies have been performed on this cohort (Reymer et al., Nature Genetics 10:28-34, 1995; Jukema et al., Circulation 94:1913-1918, 1996; Kluijtmans et al., Circulation 96:2573-2577, 1997; Kastelein et al., Clinical Genetics 53:27-33, 1998; Kuivenhoven et al., New England Journal of Medicine 338:86-93, 1998). The REGRESS and its DNA substudies were approved by all seven institutional review boards of the participating centers and by their medical ethics committees.

[0219] Additional Dutch subjects with low HDL and premature coronary artery disease were obtained from previously described populations (Kuivenhoven et al., Arteriosclerosis, Thrombosis and Vascular Biology 17:560-568, 1997; Verhoff et al., Atherosclerosis 141:161-166 1998; Franco et al., British Journal of Hematology 102:1172-1175, 1998; Wittekoek et al., Artherosclerosis 146:271-279, 1999; Franko et al., British Journal of Hematology 104:50-54, 1999). Dutch control subjects were taken from a large population based study designed to assess the effects of various risk factors on CAD (Seidell et al., International Journals of Obesity 19:924-927 1995; Kuivenhoven et al. Arteriosclerosis, Thrombosis and Vascular Biology 17:595-599,1997). French Canadian subjects were a random sample of individuals. The South African Black and Cantonese cohorts have previously been described (Ehrenborg et al., Arteriosclerosis, Thrombosis and Vascular Biology 17:2672-2678, 1997). All subjects gave informed consent.

[0220] cSNP Screening

[0221] For each variant, a restriction enzyme whose cleavage pattern was altered by the variant was identified for development of an RFLP assay. If no suitable enzyme was found, a mismatch strategy was employed, whereby a single nucleotide mismatch was incorporated into the PCR primer, creating a restriction site in combination with either the wild-type or variant allele. The specific conditions of all assays are described in FIG. 17. All PCR reactions were carried out in 50 μL volumes, in the presence of 1×PCR buffer and 1.5 μM MgCl₂ (Life Technologies). Thermocycling parameters for all assays were as follows: 95° C. for 3 min; 35 cycles of denaturation at 95° C. for 10 seconds, annealing for 30 seconds at the temperature specified in FIG. 17, and elongation for 30 seconds at 72° C.; and a final elongation at 72° C. for 10 min. All digestions (15-20 μL PCR product) were carried out in manufacturer's buffer (New England Biolabs) for 2 hours at the temperature specified by the manufacturer. As an example, the digest results for the R219K are shown in FIG. 18. A 177 base pair fragment with the A allele is not cut by EcoNI, whereas the B allele is digested to produce fragments of 107 and 70 base pairs. Heterozygous individuals thus display all three bands (177, 107, and 70 base pairs).

[0222] Genotyping with the TaqMan(r) Assay

[0223] To facilitate the mass screening of some variants, TaqMan based polymerase chain reaction (PCR) assays (Holland et al., Proc. Natl. Acad. Sci. 88 (16):7276-7280, 1991; Lee et al., Nucleic Acids Research 21 (16):3761-3766, 1993) were developed for the detection of polymporphisms in the ABC1 gene. In this one-tube assay, two fluorogenic hybridization probes (one for each allele) are labeled with different fluorescent reporter dyes (FAM or TET) at their 5′ terminus and a common quencher dye (TAMRA) at their 3′ terminus. These probes are cleaved by the 5′ nuclease activity of Taq enzyme during PCR amplification. This cleavage separates the reporter from the quencher dye and generates an increase in reporter fluorescence. By using two different reporter dyes, cleavage of allele-specific probes can be detected in a single PCR. The difference in the measured fluorescence intensity between the two TaqMan probes allows for accurate allele calling.

[0224] PCR amplifications with flanking sets of primers (300 nM) in the presence of two TaqMan probes (25 nM each) and 4.5 mM MgCl₂ were performed using the following thermocycling protocol: initial denaturation at 96° C. for 10 min, followed by 39 cycles of 96° C. for 30 sec, 63° C. for 1 min and 72° C. for 15 sec, followed by a final extension at 72° C. for 10 min. Each plate included controls (no DNA template) as well as standards of each known genotype. Fluorescence quantification and genotype determination were performed on a Perkin Elmer LS50B or ABI Prism 7700 Sequence Detector. The fluorescence from each reaction was normalized to the signal from the no-template controls.

[0225] Cellular Cholesterol Efflux Studies

[0226] Cellular cholesterol efflux from fibroblast cultures was measured as described above in the “Analysis of ABC1 Heterozygotes” section.

[0227] Statistics

[0228] Within the REGRESS population, the baseline characteristics of the patients in the three genotypes (M, AB, BB) was compared using one way analysis of variance and the chi-square test, where appropriate. In cases where the BB genotype was rare, the M group was compared to the combined group AB+BB.

[0229] The cumulative coronary event incidence in these genotypic groups was compared using the logrank test. The relation between age and HDL level was investigated using a linear regression model, and the difference between genotypes with respect to the slopes of this regression line was tested using covariance analysis. In addition to this regression analysis, the R219K genotypes were compared among age-defined subgroups. Randomization to placebo and pravastatin was assessed by chi square analysis and was equivalent in all genotypic groups for all variants except the R1587K variant. For this variant, a lower proportion of carriers was randomized to pravastatin treatment. In addition, change in MOD, MSD, and prevalence of coronary events (the three variables measured during the trial and thus following randomization) were analyzed for the placebo and pravastatin subgroups separately. Similar genotypic effects for each of the variants were observed in the treatment subgroups (i.e. the pravastatin and placebo control groups).

[0230] All lipid levels are reported in mmol/L. All values are reported as mean A standard deviation.

[0231] Association of R219K Polymorphism with Reduced Triglyceride Levels and a Decreased Risk of CAD

[0232] The common R219K polymorphism results in the substitution of a lysine for an arginine at amino acid 219 of the ABC1 protein. The allele frequency of the variant, or “B”, allele was 0.25, and its carrier frequency was 46.3%, as shown in FIG. 19.

[0233] The relationship between this polymorphism and CAD was examined. The B allele of the R219K polymorphism was associated with decreased baseline CAD. Both the MSD and the MOD, were significantly increased in an allele-dose dependent fashion from M to AB to BB (FIG. 20).

[0234] The angiographic data was paralleled by differences in clinical coronary events rates. A smaller percentage of individuals homozygous for the rarer B allele (BB) had had a myocardial infarction (MI) prior to the initiation of the trial. Carriers of the B allele exhibited a strong trend towards an increased prevalence of no events over the course of the study (FIG. 21, p=0.07).

[0235] The association of the R219K polymorphism with reduced CAD was further supported by the reduced frequency of this allele in the REGRESS cohort which was selected for CAD. In fact, the genotype frequencies observed for this variant are not consistent with Hardy-Weinberg equilibrium (p=0.004). There are fewer BB individuals and more AB individuals than would be expected (424 AA, 330 AB, and 36 BB individuals were observed compared to the expected 439 AA, 300 AB, and 51 BB individuals). As the REGRESS cohort was selected for men with CAD, the lack of Hardy-Weinberg equilibrium suggests that there was a preferential selection against BB individuals, consistent with the reduced CAD observed in this group.

[0236] There were no obvious differences in mean HDL-C levels between the genotypes in the groups as a whole (FIG. 22); however, triglycerides were significantly lower in the carriers of the B allele. This suggests that ABC1 function may also directly influence plasma triglyceride levels, and that this variant may be associated with a gain of ABC1 function. These findings are also consistent with the decreased CAD observed in carriers, suggesting that ABC1 modulation of triglyceride levels may be another mechanism whereby ABC1 activity influences risk of CAD.

[0237] To further explore the apparent lack of effect of this variant on HDL-C, the relationship between R219K genotype and HDL at various ages was examined. In younger individuals, the carriers of the B allele had increased HDL-C compared to non-carriers (FIG. 23) Furthermore, while HDL-C increased with age in the AA individuals, heterozygous carriers showed a much milder increase, and HDL cholesterol decreased with age in homozygous carriers (FIG. 24). In the AA individuals, HDL cholesterol is positively correlated with age (p<0.001). In contrast, this relation was not apparent in individuals heterozygous for this variant, and HDL cholesterol was negatively correlated with age in the BB homozygotes, although neither the correlation for AB nor the correlation for BB individuals was statistically different from zero (FIG. 25A). The age-related increase in HDL-C in AA individuals is not maintained in carriers of the B allele (p value comparing slopes=0.04). Thus, the decreased CAD in carriers of the R219K may also be related to the fact that for the majority of their lifetime, carriers have had increased HDL-C compared to non-carriers.

[0238] The changes in HDL-C with age in the various R219K genotypes are mirrored by similar trends in the changes in cholesterol efflux with age (FIG. 25B) We have genotyped this variant in all individuals in whom we have measured efflux and who do not possess an ABC1 mutation. In AA individuals (n=30), cholesterol efflux increases with age, whereas in AB and BB individuals (n=24), efflux decreases with age (p-value comparing slopes=0.15). Thus, the differential age-related changes in HDL seen in the different R219K genotypes are consistent with similar functional changes in ABC1 activity.

[0239] From FIG. 23 it can be seen that triglyceride levels generally decrease with age, a finding seen in all R219K genotypes. The percent decrease in triglyceride levels was nearly half in carriers (9.3%) compared to non-carriers (17.3%, p=0.07). Differences between the genotypes are also observed in the MSD and MOD. In the non-carriers, MSD and MOD measurements decrease significantly with age, reflecting increased atherosclerosis in the older individuals (FIGS. 23 and 26). In contrast, in carriers of the R219K variant, these measurements do not significantly change with age. Thus, vascular disease progresses much more slowly with age in carriers of the R219K variant compared to non-carriers.

[0240] Populations of Asian and African origins have been shown to have increased HDL-C, decreased triglyceride levels, and decreased risk of CAD compared to Caucasian populations (Tyroler et al., Circulation 62 (Suppl. IV):IV-99-IV-107, 1980; Tao et al., International Journal of Epidemiology 21 (5):893-903, 1992; Brown et al., Arteriosclerosis and Thrombosis 13:1139-1158. 1993; Adedeji, Tropical and Geographical Medicine 46 (1):23-26, 1994; Simon et al., American Journal of Public Health 85 (12):1698-1702, 1995; Morrison et al., Metabolism 47 (5):514-521, 1998), a finding paralleling the phenotypic effects of this variant. Thus, as SNP frequencies can often differ within different ethnic groups, the frequency of this variant within these two population groups was examined (FIG. 27). This variant is seen much more commonly in individuals of either Cantonese or South African Black descent, in which it is the predominant allele. This suggests that the increased frequency of this variant may in part account for the increased HDL, decreased triglyceride levels, and decreased CAD observed in these populations compared to Caucasian populations.

[0241] Effect of Other cSNPs on Plasma Lipid Levels and Risk of CAD

[0242] No significant differences in lipid levels or CAD compared to respective non-carriers have been observed for carriers of the T774P (n=4), K776N (n=3) or E1172D variants (n=34). Carriers of the V825I (n=103 AB, 4 BB) had no obvious differences in lipid levels or baseline MSD or MOD. Carriers of the V825I variant did, however, have a significantly increased number of events during the trial (44% versus 33%, p=0.001).

[0243] No carriers of the S1731C variant were detected in the REGRESS population, but this variant was found in one of our FHA families (FHA2). In individuals heterozygous for ABC1 mutations, this variant was associated with significantly decreased HDL-C (0.16 Å 0.04, n=2 versus 0.64 Å 0.14, n=10; p=0.0009 in carriers versus non-carriers). In unaffected family members, however, while carriers of the S1731C had lower HDL-C compared to non-carriers (1.03 Å 0.22 versus 1.09 Å 0.23), this was not statistically significant. The control individual in whom this variant was also seen (FIG. 19) had low plasma HDL-C (0.72 mmol/L).

[0244] Similar to carriers of the R219K, carriers of the V771M variant (n=37), had no difference in HDL-C compared to non-carriers; however, a marginally significant interaction between age and genotype on HDL-C levels was noted (p=0.05). All but 2 carriers of the V771M variant are also carriers of the R219K variant. The interaction between age and genotype on HDL-C remains nearly significant when adjusted for R219K genotype (p=0.11), thus this variant may have an age effect independent of that which can be attributed to the R219K. A trend to less CAD (increased MOD) was observed in carriers of this variant compared to non-carriers (1.89 Å 0.38 versus 1.76 Å 0.35, p=0.13).

[0245] For the I883M cSNP, homozygous BB individuals (n=14) have increased progression in MOD (mean change of 0.53 Å 79 versus 0.11 Å 0.25, p<0.001). BB individuals had an events rate double that of the AA individuals (21.4% versus 10.6%) although this was not statistically significant (p=0.19). No difference was observed in mead lipid levels between the I883M genotypes. Furthermore, there were significantly more BB individuals than expected under Hardy-Weinberg equilibrium (14 BB, 86 AB, and 320 AA individuals observed compared to the expect 8 BB, 98 AB, and 314 M individuals). As this cohort was selected for individuals with CAD, this might suggest a preferential inclusion of those with the BB genotype. The association of this variant with CAD is further supported by the significantly increased frequency of this variant in the premature CAD population (odds ratio for CAD in carriers of this variant=0.43, 95% confidence interval 0.22-0.85, p=0.01) (FIG. 19). These findings contrast with those of a very recent report which suggests that the homozygous carriers of this cSNP have increased HDL-C (Wang et al., Arteriosclerosis, Thrombosis and Vascular Biology 20:1983-1989, 2000).

[0246] Carriers of the V399A (AB, n=9) had a trend to higher HDL-C (1.03 Å 0.28 versus 0.92 Å 0.23, p=0.15) compared to individuals who were AA at this site (n=540). No events were observed in the AB group (compared to 14% in M's, p=NS), and carriers had half the prevalence of a family history of CAD (22.2% versus 49.4%, p=0.18). Furthermore, consistent with this data, the carriers had a trend to increased baseline MOD (1.92 Å 0.32 versus 1.73 Å 0.35, p=0.13) and to less progression in MSD (−0.05A0.10 versus 0.08 Å 0.19, p=0.16) during the trial. However, as the number of carriers of small, firm conclusions regarding the relationship of this variant to increased HDL-C and decreased CAD cannot be drawn.

[0247] Carriers of the R1587K variant (AB, BB) have decreased HDL-C compared to non-carriers in an allele-dose dependent trend (0.86 Å 0.16, 0.91 Å 0.23, and 0.94 Å 0.23 for BB, AB, and M, respectively, p=0.03). No significant interaction with age was noted (p=0.32). Furthermore, on multiple regression including age, BMI, smoking, and triglyceride levels as covariates, the R1587K genotype is a significant predictor of HDL-C (p=0.027). However, no significant differences in CAD or events during the trial were evident in carriers compared to non-carriers.

[0248] Summary of Association Studies of ABC1 Gene Variants and HDL Levels or Cardiovascular Disease

[0249] The following polymporphisms have been examined for their effect on cholesterol regulation and the predisposition for the development of cardiovascular disease. The polymporphisms are numbered from the nucleotide described as position 1 (Pullinger et al., supra), naming the first exon number 1.

[0250] Substitution of A for G at nucleotide 1051 (R219K). Carriers of this variant have reduced triglyceride levels, increased HDL cholesterol levels (particularly in younger individuals), and reduced CAD.

[0251] Substitution of C for T at nucleotide 1591 (V399A). This variant was associated with a trend towards increased HDL cholesterol in carriers.

[0252] Substitution of A for G at nucleotide 2706 (V771M). Carriers of this variant have been shown to have decreased CAD.

[0253] Substitution of C for A at nucleotide 2715 (T774P). This variant was seen less often in individuals with low HDL cholesterol levels or CAD than in controls.

[0254] Substitution of C for G at nucleotide 2723 (K776N). This variant has been found at a lower frequency (0.54% versus 1.89%) in a coronary artery disease population versus a control population of similar Dutch background.

[0255] Substitution of C for G at nucleotide 3911 (E1172D). This variant is seen at lower frequencies in individuals with low HDL and in some populations with premature coronary artery disease.

[0256] Substitution of A for G at nucleotide 5155 (R1587K). This variant is associated with decreased HDL cholesterol levels in carriers.

[0257] Substitution of G for C at nucleotide 5587 (S1731C). Two FHA individuals who have this variant on the other allele have much lower HDL cholesterol (0.155 Å 0.025) than the FHA individuals in the family who do not have this variant on the other allele (0.64 Å 0.14, p=0.0009). This variant has also been found in one general population French Canadian control with HDL at the 8^(th) percentile (0.92) and one French Canadian individual from a population selected for low HDL cholesterol levels and coronary disease (0.72).

[0258] Substitution of G for A at nucleotide 2723 (I883M). This variant has been seen at a much higher frequency in individuals of Dutch ancestry with premature coronary artery disease. Furthermore, homozygous carriers of this variant have significantly increased CAD progression compared to non-carriers.

[0259] Substitution of A for G at nucleotide 2868 (V825I). Carriers of this variant had significantly more CAD events than individuals who do not have this variant.

[0260] Substitution of C for G at nucleotide −191. Homozygous carriers of this variant have a three-fold increase in the frequency of coronary events (33.3% versus 11.2%, p=0.003) and a nearly double frequency of a positive family history of CAD (73.3% versus 47.7%, p=0.01).

[0261] Substitution of G for C at nucleotide −17. Carriers of this variant have significantly decreased coronary events (12.3% versus 18.2%, p=0.04) and a significantly decreased incidence of myocardial infarction (heart attack, 43.6% versus 52.8%, p=0.02).

[0262] Substitution of T for C at nucleotide 69. Carriers of this variant have increased CAD progression compared to non-carriers.

[0263] Substitution of G for C at nucleotide 127. Carriers of this variant have a trend towards decreased progression of CAD compared to non-carriers.

[0264] Insertion of CCCT at nucleotide −1163 in intron 1. Carriers of this variant have a trend to lower HDL cholesterol levels.

[0265] Substitution of G for A at nucleotide −1095 in intron 1. Homozygous carriers of this variant have a trend towards decreased HDL cholesterol and increased triglyceride levels compared to non-carriers.

[0266] Substitution of A for G at nucleotide −1027 in intron 1. Carriers of this variant are also carriers of the G(−720)A. Thus the effects attributed to that variant may also be attributed to carriers of this variant.

[0267] Substitution of A for G at nucleotide −720 in intron 1. Homozygous carriers of this variant had a trend towards an increased frequency of a positive family history of myocardial infarction.

[0268] Substitution of C for A at nucleotide −461 in intron 1. Carriers of this variant are also carriers of the A(−362)G. Thus the effects attributed to that variant may also be attributed to carriers of this variant.

[0269] Substitution of G for A at nucleotide −362 in intron 1. Carriers of this variant have decreased triglyceride levels compared to non-carriers.

[0270] Insertion of G at nucleotide 319. Carriers of this variant have increased CAD compared to non-carriers.

[0271] Substitution of G for C at nucleotide 378. Carriers of this variant are also carriers of the InsG319. Thus the effects attributed to that variant may also be attributed to carriers of this variant.

[0272] Functional Role of LXRE Binding Sites in ABC1 Genomic Sequence

[0273] The functional role of three of the LXRE consensus binding sites identified in the ABC1 genomic sequence of SEQ ID NO: 1 was confirmed using standard gel shift assay experiments. Briefly, the sequences of the LXRE consensus binding sites located at 4389 and −1641 of the promoter region, +4 of exon 1, and −7670 and −7188 of 3′ intron 1 (FIG. 3) were tested in gel shift assays using the cyp7 LXRE described by Lehmann et al. as a positive control (Lehmann et al., J. Biol. Chem. 272:3137-3140, 1997).

[0274] In the first gel shift assay, binding of the labeled cyp7 LXRE probe was competed by a 400-fold excess of cold cyp7 probe, 4389 probe, −1641 probe, +4 probe, or −7670 probe. The signal disappeared completely in all cases.

[0275] For the next assay, the 4389, −1641, +4, −7670 and −7188 probes were labeled and binding to LXR-RXR complex was assayed. A signal similar to that of the positive control was observed for each of ABC1 LXRE probes, although the signal was slightly more intense for the +4 probe.

[0276] A competition assay was also performed using lesser quantities of each cold probe, i.e. 5-fold, 25-fold, and 50-fold more cold probe than labeled cyp7 probe. There was a dose dependant decrease in the signal for each of the probes. This decrease was more significant for the +4 and −7670 probes. Moreover, the signal was not modified by competition with a cold Dr2 like probe, suggesting that the competition effect is indeed specific.

[0277] Thus, each of the tested potential LXRE binding sites seem to bind an in vitro LXR-RXR heterodimer. The LXRE binding site at +4 in exon 1 appears to have the highest affinity, closely followed by the LXRE binding site at −7670 in 3′ intron 1.

[0278] Agonists and Antagonists

[0279] Useful therapeutic compounds include those which modulate the expression, activity, or stability of ABC1. To isolate such compounds, ABC1 expression, biological activity, or regulated catabolism is measured following the addition of candidate compounds to a culture medium of ABC1-expressing cells. Alternatively, the candidate compounds may be directly administered to animals (for example mice, pigs, or chickens) and used to screen for their effects on ABC1 expression.

[0280] In addition its role in the regulation of cholesterol, ABC1 also participates in other biological processes for which the development of ABC1 modulators would be useful. In one example, ABC1 transports interleukin-1β (IL-1β) across the cell membrane and out of cells. IL-1β is a precursor of the inflammatory response and, as such, inhibitors or antagonists of ABC1 expression or biological activity may be useful in the treatment of any inflammatory disorders, including but not limited to rheumatoid arthritis, systemic lupus erythematosis (SLE), hypo- or hyper-thyroidism, inflammatory bowel disease, and diabetes mellitus. In another example, ABC1 expressed in macrophages has been shown to be engaged in the engulfment and clearance of dead cells. The ability of macrophages to ingest these apoptotic bodies is impaired after antibody-mediated blockade of ABC1. Accordingly, compounds that modulate ABC1 expression, stability, or biological activity would be useful for the treatment of these disorders.

[0281] ABC1 expression is measured, for example, by standard Northern blot analysis using an ABC1 nucleic acid sequence (or fragment thereof as a hybridization probe, or by Western blot using an anti-ABC1 antibody and standard techniques. The level of ABC1 expression in the presence of the candidate molecule is compared to the level measured for the same cells, in the same culture medium, or in a parallel set of test animals, but in the absence of the candidate molecule. ABC1 activity can also be measured using the cholesterol efflux assay.

[0282] Transcriptional Regulation of ABC1 Expression

[0283] ABC1 mRNA is increased approximately 8-fold upon cholesterol loading. This increase is likely controlled at the transcriptional level. Using the genomic sequence described herein, one can identify transcription factors that bind to the 5′ regulatory sequence by performing, for example, gel shift assays, DNAse protection assays, or in vitro or in vivo reporter gene-based assays. The identified transcription factors are themselves drug targets. In the case of ABC1, drug compounds that act through modulation of transcription of ABC1 could be used for HDL modulation, triglyceride modulation, atherosclerosis prevention, and the treatment of cardiovascular disease. For example, using a compound to inhibit a transcription factor that represses ABC1 would be expected to result in up-regulation of ABC1 and, therefore, up-regulation of HDL cholesterol levels and down-regulation of triglyceride levels. In another example, a compound that increases transcription factor expression or activity would also increase ABC1 expression, increase HDL levels, and decrease triglyceride levels.

[0284] Transcription factors known to regulate other genes in the regulation of apolipoprotein genes or other cholesterol- or lipid-regulating genes are of particular relevance. Such factors include, but are not limited to, the steroid response element binding proteins (SREBP-1 and SREBP-2), and the PPAR (peroxisomal proliferation-activated receptor), RXR, and LXR transcription factors. Several consensus sites for certain elements are present in the sequenced region 5′ to the ABC1 gene (FIG. 3) and thus are likely to modulate ABC1 expression. For example, LXRs may alter transcription of ABC1 by mechanisms including heterodimerization with retinoid X receptors (RXRs) and then binding to specific response elements (LXREs). Examples of such LXRs include LXRα and LXRβ. Compounds that modulate LXR-mediated transcriptional activation are likely to modulate ABC1 gene expression and thus are useful for modulating HDL cholesterol levels and triglyceride levels. Janowski et al. (Proc. Natl. Acad. Sci. USA 96:266-271, 1999) described the role of naturally occurring oxysterols in LXR-dependent transactivation through the promoter for cholesterol 7a-hydoxylase (Cyp7a), which is the rate limiting enzyme in bile acid synthesis. Janowski further demonstrated that oxysterols bind directly to LXRs. The position specific mono-oxidation of the sterol side chain is required for LXR high affinity binding and activation. Enhanced binding could be achieved by use of 24-oxo ligands. Oxygens at more than one carbon on the side chain of cholesterol diminished LXR binding and activation as compared to monoxygenated analogs. LXR ligands were found to require a single stereoselective oxygen on the sterol side chain that functioned as a hydrogen acceptor. Introduction of dimethylamide exhibited the greatest binding and activation compared to an ester or carbonyl group.

[0285] Compounds known to modulate LXR activity include, without limitation, 24-(S),25-epoxycholesterol; 24(S)-hydroxycholesterol; 22-(R)-hydroxycholesterol; 24(R),25-epoxycholesterol; 22(R)-hydroxy-24(S),25-epoxycholesterol; 22(S)-hydroxy-24(R),25-epoxycholesterol; 24-(S),25-iminocholesterol; methyl-38-hydroxycholonate; N,N-dimethyl-3β-hydroxycholonamide; 24(R)-hydroxycholesterol; 22(S)-hydroxycholesterol; 22(R),24(S)-dihydroxycholesterol; 25-hydroxycholesterol; 22(R)-hydroxycholesterol; 22(S)-hydroxycholesterol; 24(S),25-dihydroxycholesterol; 24(R),25-dihydroxycholesterol; 24,25-dehydrocholesterol; 25-epoxy-22(R)-hydroxycholesterol; 20(S)-hydroxycholesterol; (20R,22R)-cholest-5-ene-3β,20,22-triol; 4,4-dimethyl-5-α-cholesta-8,14,24-trien-3-β-ol; 7α-hydroxy-24(S),25-epoxycholesterol; 7β-hydroxy-24(S),25-epoxycholesterol; 7-oxo-24(S),25-expoxycholesterol; 7α-hydroxycholesterol; 7-oxocholesterol; and desmosterol. Additional LXR-modulating compounds are described, for example, in Janowski et al., Nature 383:728-731, 1996; Lehman et al., J. Biol. Chem. 272:3137-3140, 1997; and Janowski et al., Proc. Natl. Acad. Sci. 96:266-271, 1998, each of which is hereby incorporated by reference). In addition one in the art will recognize that synthetic sterols having LXR-modulating activity can be readily identified using screening methods known in the art (see, for example, Janowski et al., Proc. Natl. Acad. Sci. 96:266-271,1998). Non-steroidal agonists such as RIP140 protein, antibodies (monoclonal or polyclonal) specific for LXRA or LXRP; tetradecycloxy-furnacarboxylic acid (TOFA;); tetradecylthioacetic acid; as well as other fatty acids (see, for example, Tobin et al. Molec. Endocrin. 14: 741-752, 2000) are also useful LXR-modulating agents.

[0286] Additional transcription factors which may also have an effect in modulating ABC1 gene expression and thereby HDL levels, triglyceride levels, atherosclerosis, and CAD risk include REV-ERBα, SREBP-1 & 2, ADD-1, EBPα, CREB binding protein, P300, HNF 4, RAR, and RORα. Exemplary binding sites are depicted in FIG. 3. Additional binding sites for these factors can be found, for example, through examination of the sequence in SEQ ID NO: 1.

[0287] RXR heterodimerizes with many nuclear receptors, including LXR, and aids in transactivating the target gene. Thus, compounds that modulate RXR-mediated transcriptional activity will also modulate ABC1 expression. Numerous RXR-modulating compounds (rexinoid compounds) are known in the art, including, for example, hetero ethylene derivatives; tricyclic retinoids; trienoic retinoids; benzocycloalkenyl-alka:di- or trienoic acid derivatives; bicyclic-aromatic compounds and their derivatives; bicyclylmethyl-aryl acid derivatives; phenyl-methyl heterocyclic compounds; tetrahydro-napthyl compounds; arylthio-tetrahydro-naphthalene derivatives and heterocyclic analogues; 2,4-pentadienoic acid derivatives; tetralin-based compounds; nonatetraenoic acid derivatives; SR11237; dexamethasone; hydroxy, epoxy, and carboxy derivatives of methoprene; bicyclic benzyl, pyridinyl, thiophene, furanyl, and pyrrole derivatives; benzofuran-acrylic acid derivatives; aryl-substituted and aryl and (3-oxo-1-propenly)-substituted benzopyran, benzothiopyran, 1,2-dihydroquinoline, and 5,6-dihydronaphthalene derivatives; vitamin D3 (1,25-dihydroxyvitamin D3) and analogs; 24-hydroxylase inhibitor; mono-or polyenic carboxylic acid derivatives; tetrahdroquinolin-2-one-6 or 7-yl and related derivatives; tetrahydronaphthalene; oxyiminoalkanoic acid derivatives; LG 100268; and LGD 1069. Additional compounds include BRL 49653; troglitazone; pioglitazone; ciglitazone; WAY-120; englitazone; AD 5075; and darglitazone.

[0288] PPARs may alter transcription of ABC1 by mechanisms including heterodimerization with retinoid X receptors (RXRs) and then binding to specific proliferator response elements (PPREs). Examples of such PPARs include PPARα, β, γ and δ. These distinct PPARs have been shown to have transcriptional regulatory effects on different genes. PPARα is expressed mainly in liver, whereas PPARγ is expressed in predominantly in adipocytes. Both PPARα and PPARγ are found in coronary and carotid artery atherosclerotic plaques and in endothelial cells, smooth muscle cells, monocytes and monocyte-derived macrophages. Activation of PPARα results in altered lipoprotein metabolism through PPARα's effect on genes such as lipoprotein lipase (LPL), apolipoprotein CIII (apo CIII) and apolipoprotein AI (apo AI) and AII (apo AII). PPARα activation results in overexpression of LPL and apoA-I and apoA-II, but inhibits the expression of apo CII. PPARα activation also inhibits inflammation, stimulates lipid oxidation and increases the hepatic uptake and esterification of free fatty acids (FFA's). PPARα and PPARγ activation may inhibit nitric oxide (NO) synthase in macrophages and prevent interleukin-1 (IL-1) induced expression of IL-6 and cyclo-oxygenase-2 (COX-2) and thrombin induced endothelin-1 expression secondary to negative transcriptional regulation of NF-KB and activation of protein-1 signaling pathway. It has also been shown that PPARα induces apoptosis in monocyte-derived macrophages through the inhibition of NF-KB activity.

[0289] Activation of PPARα can be achieved by compounds such as fibrates, β-estradiol, arachidonic acid derivatives, WY-14,643 and LTB4 or 8(s)HETE. PPARy activation can be achieved through compounds such as thiozolidinedione antidiabetic drugs, 9-HODE and 13-HODE. Additional compounds such as nicotinic acid or HMG CoA reductase inhibitors may also alter the activity of PPARs.

[0290] Compounds which alter activity of any of the PPARs (e.g., PPARα or PPARγ) may have an effect on ABC1 expression and thereby could affect HDL levels, triglyceride levels, atherosclerosis, and risk of CAD. PPARs are also regulated by fatty acids (including modified fatty acids such as 3 thia fatty acids), leukotrienes such as leukotriene B4 and prostaglandin J2, which is a natural activator/ligand for PPARγ. Drugs that modulate PPARs may therefore have an important effect on modulating lipid levels (including HDL and triglyceride levels) and altering CAD risk. This effect could be achieved through the modulation of ABC1 gene expression. Drugs may also effect ABC1 gene expression and thereby HDL and triglyceride levels, by an indirect effect on PPARs via other transcriptional factors such as adipocyte differentiation and determination factor-1 (ADD-1) and sterol regulatory element binding protein-1 and 2 (SREBP-1 and 2). Drugs with combined PPARα and PPARγ agonist activity or PPARα and PPARγ agonists given in combination for example, may increase HDL levels or decrease triglyceride levels even more.

[0291] A PPAR binding site (PPRE element) is found 5′ to the ABC1 gene (FIG. 3). Like the PPRE elements found in the C-ACS, HD, CYP4A6 and ApoA-1 genes, this PPRE site is a trimer related to the PPRE consensus sequence. Partly because of its similarity in the number and arrangement of repeats in this PPAR binding site, this element in particular is very likely to be of physiological relevance to the regulation of the ABC1 gene.

[0292] Additional Utility of ABC1 Polypeptides, Nucleic Acids, and Modulators

[0293] ABC1 may act as a transporter of toxic proteins or protein fragments (e.g., APP) out of cells. Thus, ABC1 agonists/upregulators may be useful in the treatment of other disease areas, including Alzheimer's disease, Niemann-Pick disease, and Huntington's disease.

[0294] ABC transporters have been shown to increase the uptake of long chain fatty acids from the cytosol to peroxisomes and, moreover, to play a role in β-oxidation of very long chain fatty acids. Importantly, in x-linked adrenoleukodystrophy (ALD), fatty acid metabolism is abnormal, due to defects in the peroxisomal ABC transporter. Any agent that upregulates ABC transporter expression or biological activity may therefore be useful for the treatment of ALD or any other lipid disorder.

[0295] ABC1 is expressed in macrophages and is required for engulfment of cells undergoing programmed cell death. The apoptotic process itself, and its regulation, have important implications for disorders such as cancer, one mechanism of which is failure of cells to undergo cell death appropriately. ABC1 may facilitate apoptosis, and as such may represent an intervention point for cancer treatment. Increasing ABC1 expression or activity or otherwise up-regulating ABC1 by any method may constitute a treatment for cancer by increasing apoptosis and thus potentially decreasing the aberrant cellular proliferation characterized by this disease. Conversely, down-regulation of ABC1 by any method may provide opportunity for decreasing apoptosis and allowing increased proliferation of cells in conditions where cell growth is limited. Such disorders include but are not limited to neurodeficiencies and neurodegeneration, and growth disorders. ABC1 could, therefore, be used as a method for identification of compounds for use in the treatment of cancer, or in the treatment of degenerative disorders.

[0296] Agents that have been shown to inhibit ABC1 include, for example, the anti-diabetic agents glibenclamide and glyburide, flufenamic acid, diphenylamine-2-carbonic acid, sulfobromophthalein, and DIDS.

[0297] Agents that upregulate ABC1 expression or biological activity include but are not limited to protein kinase A, protein kinase C, vanadate, okadaic acid, and IBMX1.

[0298] Those in the art will recognize that other compounds can also modulate ABC1 biological activity, and these compounds are also in the spirit of the invention.

[0299] Drug Screens Based on the ABC1 Regulatory Regions, Gene, or Protein

[0300] The ABC1 protein and gene can be used in screening assays for identification of compounds which modulate its activity and may be potential drugs to regulate cholesterol or triglyceride levels. The ABC1 5′ regulatory sequence and other regulatory regions (e.g., exon 1 and exon 2) can be used in screening assays for identification of compounds which modulate ABC1 expression and may be potential drugs to regulate lipid levels, including, for example, HDL-C, LDL-C, and triglycerides. Drug screens to identify compounds that modulate ABC1 expression may employ an ABC1 regulatory region operably linked to ABC1. Preferably, however, the regulatory region is operably linked to a reporter gene (e.g., a gene encoding GFP, chloramphenicol acetyltransferase, or beta-galactosidase).

[0301] Useful ABC1 proteins include wild-type and mutant ABC1 proteins or protein fragments, in a recombinant form or endogenously expressed. Drug screens to identify compounds acting on the ABC1 expression product may employ any functional feature of the protein. In one example, the phosphorylation state or other post-translational modification is monitored as a measure of ABC1 biological activity. ABC1 has ATP binding sites, and thus assays may wholly or in part test the ability of ABC1 to bind ATP or to exhibit ATPase activity. ABC1, by analogy to similar proteins, is thought to be able to form a channel-like structure; drug screening assays could be based upon assaying for the ability of the protein to form a channel, or upon the ability to transport cholesterol or another molecule, or based upon the ability of other proteins bound by or regulated by ABC1 to form a channel. Alternatively, phospholipid or lipid transport can also be used as measures of ABC1 biological activity.

[0302] There is evidence that, in addition to its role as a regulator of cholesterol levels, ABC1 also transports anions. Functional assays could be based upon this property, and could employ drug screening technology such as (but not limited to) the ability of various dyes to change color in response to changes in specific ion concentrations in such assays can be performed in vesicles such as liposomes, or adapted to use whole cells.

[0303] Drug screening assays can also be based upon the ability of ABC1 or other ABC transporters to interact with other proteins. Such interacting proteins can be identified by a variety of methods known in the art, including, for example, radioimmunoprecipitation, co-immunoprecipitation, co-purification, and yeast two-hybrid screening. Such interactions can be further assayed by means including but not limited to fluorescence polarization or scintillation proximity methods. Drug screens can also be based upon functions of the ABC1 protein deduced upon X-ray crystallography of the protein and comparison of its 3-D structure to that of proteins with known functions. Such a crystal structure has been determined for the prokaryotic ABC family member H is P, histidine permease. Drug screens can be based upon a function or feature apparent upon creation of a transgenic or knockout mouse, or upon overexpression of the protein or protein fragment in mammalian cells in vitro. Moreover, expression of mammalian (e.g., human) ABC1 in yeast or C. elegans allows for screening of candidate compounds in wild-type and mutant backgrounds, as well as screens for mutations that enhance or suppress an ABC1-dependent phenotype. Modifier screens can also be performed in ABC1 transgenic or knock-out mice.

[0304] Additionally, drug screening assays can also be based upon ABC1 functions deduced upon antisense interference with the gene function. Intracellular localization of ABC1, or effects which occur upon a change in intracellular localization of the protein, can also be used as an assay for drug screening. Immunocytochemical methods will be used to determine the exact location of the ABC1 protein.

[0305] Human and rodent ABC1 protein can be used as an antigen to raise antibodies, including monoclonal antibodies. Such antibodies will be useful for a wide variety of purposes, including but not limited to functional studies and the development of drug screening assays and diagnostics. Monitoring the influence of agents (e.g., drugs, compounds) on the expression or biological activity of ABC1 can be applied not only in basic drug screening, but also in clinical trials. For example, the effectiveness of an agent determined by a screening assay as described herein to increase ABC1 gene expression, protein levels, or biological activity can be monitored in clinical trails of subjects exhibiting altered ABC1 gene expression, protein levels, or biological activity. Alternatively, the effectiveness of an agent determined by a screening assay to modulate ABC1 gene expression, protein levels, or biological activity can be monitored in clinical trails of subjects exhibiting decreased altered gene expression, protein levels, or biological activity. In such clinical trials, the expression or activity of ABC1 and, preferably, other genes that have been implicated in, for example, cardiovascular disease can be used to ascertain the effectiveness of a particular drug.

[0306] For example, and not by way of limitation, genes, including ABC1, that are modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates ABC1 biological activity (e.g., identified in a screening assay as described herein) can be identified. Thus, to study the effect of agents on cholesterol levels, triglyceride levels, or cardiovascular disease, for example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of ABC1 and other genes implicated in the disorder. The levels of gene expression can be quantified by Northern blot analysis or RT-PCR, or, alternatively, by measuring the amount of protein produced, by one of a number of methods known in the art, or by measuring the levels of biological activity of ABC1 or other genes. In this way, the gene expression can serve as a marker, indicative of the physiological response of the cells to the agent. Accordingly, this response state may be determined before, and at various points during, treatment of the individual with the agent.

[0307] In a preferred embodiment, the present invention provides a method for monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate identified by the screening assays described herein) including the steps of (i) obtaining a pre-administration sample from a subject prior to administration of the agent; (ii) detecting the level of expression of an ABC1 protein, mRNA, or genomic DNA in the preadministration sample; (iii) obtaining one or more post-administration samples from the subject; (iv) detecting the level of expression or activity of the ABC1 protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing the level of expression or activity of the ABC1 protein, mRNA, or genomic DNA in the pre-administration sample with the ABC1 protein, mRNA, or genomic DNA in the post administration sample or samples; and (vi) altering the administration of the agent to the subject accordingly. For example, increased administration of the agent may be desirable to increase the expression or activity of ABC1 to higher levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased administration of the agent may be desirable to decrease expression or activity of ABC1 to lower levels than detected.

[0308] The ABC1 gene or a fragment thereof can be used as a tool to express the protein in an appropriate cell in vitro or in vivo (gene therapy), or can be cloned into expression vectors which can be used to produce large enough amounts of ABC1 protein to use in in vitro assays for drug screening. Expression systems which may be employed include baculovirus, herpes virus, adenovirus, adeno-associated virus, bacterial systems, and eucaryotic systems such as CHO cells. Naked DNA and DNA-liposome complexes can also be used.

[0309] Assays of ABC1 activity includes binding to intracellular interacting proteins; interaction with a protein that up-regulates ABC1 activity; interaction with HDL particles or constituents; interaction with other proteins which facilitate interaction with HDL or its constituents; and measurement of cholesterol efflux. Furthermore, assays may be based upon the molecular dynamics of macromolecules, metabolites and ions by means of fluorescent-protein biosensors. Alternatively, the effect of candidate modulators on expression or activity may be measured at the level of ABC1 protein production using the same general approach in combination with standard immunological detection techniques, such as Western blotting or immunoprecipitation with an ABC1-specific antibody. Again, useful cholesterol- or triglyceride-regulating or anti-CVD therapeutic modulators are identified as those which produce an change in ABC1 polypeptide production. Agonists may also affect ABC1 activity without any effect on expression level.

[0310] Candidate modulators may be purified (or substantially purified) molecules or may be one component of a mixture of compounds (e.g., an extract or supernatant obtained from cells). In a mixed compound assay, ABC1 expression is tested against progressively smaller subsets of the candidate compound pool (e.g., produced by standard purification techniques, e.g., HPLC or FPLC; Ausubel et al.) until a single compound or minimal compound mixture is demonstrated to modulate ABC1 expression.

[0311] Agonists, antagonists, or mimetics found to be effective at modulating the level of cellular ABC1 expression or activity may be confirmed as useful in animal models (for example, mice, pigs, rabbits, or chickens). For example, the compound may ameliorate the low HDL levels of mouse or chicken hypoalphalipoproteinemias or may lower the triglyceride levels in animal models.

[0312] A compound that promotes an increase in ABC1 expression or activity is considered particularly useful in the invention; such a molecule may be used, for example, as a therapeutic to increase the level or activity of native, cellular ABC1 and thereby treat a low HDL or high triglyceride condition in an animal (for example, a human). If desired, treatment with an agonist of the invention may be combined with any other HDL-raising, triglyceride-lowering, or anti-CVD therapies.

[0313] One method for increasing ABC biological activity is to increase the stabilization of the ABC protein or to prevent its degradation. Thus, it would be useful to identify mutations in an ABC polypeptide (e.g., ABC1) that lead to increased protein stability. These mutations can be incorporated into any protein therapy or gene therapy undertaken for the treatment of low HDL-C or any other condition resulting from loss of ABC1 biological activity. Similarly, compounds that increase the stability of a wild-type ABC polypeptide or decrease its catabolism may also be useful for the treatment of low HDL-C or any other condition resulting from loss of ABC1 biological activity. Such mutations and compounds can be identified using the methods described herein.

[0314] In one example, cells expressing an ABC polypeptide having a mutation are transiently metabolically labeled during translation and the half-life of the ABC polypeptide is determined using standard techniques. Mutations that increase the half-life of an ABC polypeptide are ones that increase ABC protein stability. These mutations can then be assessed for ABC biological activity. They can also be used to identify proteins that affect the stability of ABC1 mRNA or protein. One can then assay for compounds that act on these factors or on the ability of these factors to bind ABC1.

[0315] In another example, cells expressing wild-type ABC polypeptide are transiently metabolically labeled during translation, contacted with a candidate compounds, and the half-life of the ABC polypeptide is determined using standard techniques. Compounds that increase the half-life of an ABC polypeptide are useful compounds in the present invention.

[0316] It is understood that, while ABC1 is the preferred ABC transporter for the drug screens described herein, other ABC transporters can also be used. The replacement of ABC1 with another ABC transporter is possible because it is likely that ABC transporter family members, such as ABC2, ABCR, or ABC8 will have a similar mechanism of regulation.

[0317] Exemplary assays are described in greater detail below.

[0318] Protein-Based Assays

[0319] ABC1 polypeptide (purified or unpurified) can be used in an assay to determine its ability to bind another protein (including, but not limited to, proteins found to specifically interact with ABC1). The effect of a compound on that binding is then determined.

[0320] Protein Interaction Assays

[0321] ABC1 protein (or a polypeptide fragment thereof or an epitope-tagged form or fragment thereof) is harvested from a suitable source (e.g., from a prokaryotic expression system, eukaryotic cells, a cell-free system, or by immunoprecipitation from ABC1-expressing cells). The ABC1 polypeptide is then bound to a suitable support (e.g., nitrocellulose or an antibody or a metal agarose column in the case of, for example, a his-tagged form of ABC1). Binding to the support is preferably done under conditions that allow proteins associated with ABC1 polypeptide to remain associated with it. Such conditions may include use of buffers that minimize interference with protein-protein interactions. The binding step can be done in the presence and absence of compounds being tested for their ability to interfere with interactions between ABC1 and other molecules. If desired, other proteins (e.g., a cell lysate) are added, and allowed time to associate with the ABC polypeptide. The immobilized ABC1 polypeptide is then washed to remove proteins or other cell constituents that may be non-specifically associated with it the polypeptide or the support. The immobilized ABC1 polypeptide is then dissociated from its support, and so that proteins bound to it are released (for example, by heating), or, alternatively, associated proteins are released from ABC1 without releasing the ABC1 polypeptide from the support. The released proteins and other cell constituents can be analyzed, for example, by SDS-PAGE gel electrophoresis, Western blotting and detection with specific antibodies, phosphoamino acid analysis, protease digestion, protein sequencing, or isoelectric focusing. Normal and mutant forms of ABC1 can be employed in these assays to gain additional information about which part of ABC1 a given factor is binding to. In addition, when incompletely purified polypeptide is employed, comparison of the normal and mutant forms of the protein can be used to help distinguish true binding proteins.

[0322] The foregoing assay can be performed using a purified or semipurified protein or other molecule that is known to interact with ABC1. This assay may include the following steps.

[0323] 1. Harvest ABC1 protein and couple a suitable fluorescent label to it;

[0324] 2. Label an interacting protein (or other molecule) with a second, different fluorescent label. Use dyes that will produce different quenching patterns when they are in close proximity to each other versus when they are physically separate (i.e., dyes that quench each other when they are close together but fluoresce when they are not in close proximity);

[0325] 3. Expose the interacting molecule to the immobilized ABC1 in the presence or absence of a compound being tested for its ability to interfere with an interaction between the two; and

[0326] 4. Collect fluorescent readout data.

[0327] Another assay is includes Fluorescent Resonance Energy Transfer (FRET) assay. This assay can be performed as follows.

[0328] 1. Provide ABC1 protein or a suitable polypeptide fragment thereof and couple a suitable FRET donor (e.g., nitro-benzoxadiazole (NBD)) to it;

[0329] 2. Label an interacting protein (or other molecule) with a FRET acceptor (e.g., rhodamine);

[0330] 3. Expose the acceptor-labeled interacting molecule to the donor-labeled ABC1 in the presence or absence of a compound being tested for its ability to interfere with an interaction between the two; and

[0331] 4. Measure fluorescence resonance energy transfer.

[0332] Quenching and FRET assays are related. Either one can be applied in a given case, depending on which pair of fluorophores is used in the assay.

[0333] Membrane Permeability Assay

[0334] The ABC1 protein can also be tested for its effects on membrane permeability. For example, beyond its putative ability to translocate lipids, ABC1 might affect the permeability of membranes to ions. Other related membrane proteins, most notably the cystic fibrosis transmembrane conductance regulator and the sulfonylurea receptor, are associated with and regulate ion channels.

[0335] ABC1 or a fragment of ABC1 is incorporated into a synthetic vesicle, or, alternatively, is expressed in a cell and vesicles or other cell sub-structures containing ABC1 are isolated. The ABC1-containing vesicles or cells are loaded with a reporter molecule (such as a fluorescent ion indicator whose fluorescent properties change when it binds a particular ion) that can detect ions (to observe outward movement), or alternatively, the external medium is loaded with such a molecule (to observe inward movement). A molecule which exhibits differential properties when it is inside the vesicle compared to when it is outside the vesicle is preferred. For example, a molecule that has quenching properties when it is at high concentration but not when it is at another low concentration would be suitable. The movement of the charged molecule (either its ability to move or the kinetics of its movement) in the presence or absence of a compound being tested for its ability to affect this process can be determined.

[0336] In another assay, membrane permeability is determined electro-physiologically by measuring ionic influx or efflux mediated by or modulated by ABC1 by standard electrophysiological techniques. A suitable control (e.g., TD cells or a cell line with very low endogenous ABC1 expression) can be used as a control in the assay to determine if the effect observed is specific to cells expressing ABC1.

[0337] In still another assay, uptake of radioactive isotopes into or out of a vesicle can be measured. The vesicles are separated from the extravesicular medium and the radioactivity in the vesicles and in the medium is quantitated and compared.

[0338] Nucleic Acid-Based Assays

[0339] ABC1 nucleic acid may be used in an assay based on the binding of factors necessary for ABC1 gene transcription. The association between the ABC1 DNA and the binding factor may be assessed by means of any system that discriminates between protein-bound and non-protein-bound DNA (e.g., a gel retardation assay). The effect of a compound on the binding of a factor to ABC1 DNA is assessed by means of such an assay. In addition to in vitro binding assays, in vivo assays in which the regulatory regions of the ABC1 gene are linked to reporter genes can also be performed.

[0340] Assays Measuring ABC1 Stability

[0341] A cell-based or cell-free system can be used to screen for compounds based on their effect on the half-life of ABC1 mRNA or ABC1 protein. The assay may employ labeled mRNA or protein. Alternatively, ABC1 mRNA may be detected by means of specifically hybridizing probes or a quantitative PCR assay. Protein can be quantitated, for example, by fluorescent antibody-based methods.

[0342] In Vitro mRNA Stability Assay

[0343] 1. Isolate or produce, by in vitro transcription, a suitable quantity of ABC1 mRNA;

[0344] 2. Label the ABC1 mRNA;

[0345] 3. Expose aliquots of the mRNA to a cell lysate in the presence or absence of a compound being tested for its ability to modulate ABC1 mRNA stability; and

[0346] 4. Assess intactness of the remaining mRNA at suitable time points.

[0347] In Vitro Protein Stability Assay

[0348] 1. Express a suitable amount of ABC1 protein;

[0349] 2. Label the protein;

[0350] 3. Expose aliquots of the labeled protein to a cell lysate in the presence or absence of a compound being tested for its ability to modulate ABC1 protein stability; and

[0351] 4. Assess intactness of the remaining protein at suitable time points

[0352] In Vivo mRNA or Protein Stability Assay

[0353] 1. Incubate cells expressing ABC1 mRNA or protein with a tracer (radiolabeled ribonucleotide or radiolabeled amino acid, respectively) for a very brief time period (e.g., five minutes) in the presence or absence of a compound being tested for its effect on mRNA or protein stability;

[0354] 2. Incubate with unlabeled ribonucleotide or amino acid; and

[0355] 3. Quantitate the ABC1 mRNA or protein radioactivity at time intervals beginning with the start of step 2 and extending to the time when the radioactivity in ABC1 mRNA or protein has declined by approximately 80%. It is preferable to separate the intact or mostly intact mRNA or protein from its radioactive breakdown products by a means such as gel electrophoresis in order to quantitate the mRNA or protein.

[0356] Assays Measuring Inhibition of Dominant Negative Activity

[0357] Mutant ABC1 polypeptides are likely to have dominant negative activity (i.e., activity that interferes with wild-type ABC1 function). An assay for a compound that can interfere with such a mutant may be based on any method of quantitating normal ABC1 activity in the presence of the mutant. For example, normal ABC1 facilitates cholesterol efflux, and a dominant negative mutant would interfere with this effect. The ability of a compound to counteract the effect of a dominant negative mutant may be based on cellular cholesterol efflux, or on any other normal activity of the wild-type ABC1 that was inhibitable by the mutant.

[0358] Assays Measuring Phosphorylation

[0359] Glu89 in the wild-type chicken ABC1 polypeptide is likely to be part of a phosphorylation motif, and thus elimination of this phosphorylation motif by the E_K ABC1 mutation in the WHAM chicken (discussed further below) may be responsible for reduced biological activity of WHAM chicken ABC1. Thus, compounds that modulate the phosphorylation state of ABC1 are likely to be clinically relevant modulators of human ABC1 activity.

[0360] The effect of a compound on ABC1 phosphorylation can be assayed by methods that quantitate phosphates on proteins or that assess the phosphorylation state of a specific residue of a ABC1. Such methods include but are not limited to ³²p labeling and immunoprecipitation, detection with antiphosphoamino acid antibodies (e.g., antiphosphoserine antibodies), phosphoamino acid analysis on 2-dimensional TLC plates, and protease digestion fingerprinting of proteins followed by detection of ³²P-labeled fragments.

[0361] Assays Measuring Other Post-Translational Modifications

[0362] The effect of a compound on the post-translational modification of ABC1 is based on any method capable of quantitating that particular modification. For example, effects of compounds on glycosylation may be assayed by treating ABC1 with glycosylase and quantitating the amount and nature of carbohydrate released.

[0363] Assays Measuring ATP Binding

[0364] The ability of ABC1 to bind ATP provides another assay to screen for compounds that affect ABC1. ATP binding can be quantitated as follows.

[0365] 1. Provide ABC1 protein at an appropriate level of purity and reconstitute it in a lipid vesicle;

[0366] 2. Expose the vesicle to a labeled but non-hydrolyzable ATP analog (such as gamma ³⁵S-ATP) in the presence or absence of compounds being tested for their effect on ATP binding. Note that azido-ATP analogs can be used to allow covalent attachment of the azido-ATP to protein (by means of UV light), and permit easier quantitation of the amount of ATP bound to the protein; and

[0367] 3. Quantitate the amount of ATP analog associated with ABC1

[0368] Assays Measuring ATPase Activity

[0369] Quantitation of the ATPase activity of ABC1 can also be assayed for the effect of compounds on ABC1. This is preferably performed in a cell-free assay so as to separate ABC1 from the many other ATPases in the cell. An ATPase assay may be performed in the presence or absence of membranes, and with or without integration of ABC1 protein into a membrane. If performed in a vesicle-based assay, the ATP hydrolysis products produced or the ATP hydrolyzed may be measured within or outside of the vesicles, or both. Such an assay may be based on disappearance of ATP or appearance of ATP hydrolysis products.

[0370] For high-throughput screening, a coupled ATPase assay is preferable. For example, a reaction mixture containing pyruvate kinase and lactate dehydrogenase can be used. The mixture includes phosphoenolpyruvate (PEP), nicotinamide adenine dinucleotide (NAD⁺), and ATP. The ATPase activity of ABC1 generates ADP from ATP. The ADP is then converted back to ATP as part of the pyruvate kinase reaction. The product, pyruvate, is then converted to lactate. The latter reaction generates a colored quinone (NADH) from a colorless substrate (NAD+), and the entire reaction can be monitored by detection of the color change upon formation of NADH. Since ADP is limiting for the pyruvate kinase reaction, this coupled system precisely monitors the ATPase activity of ABC1.

[0371] Assays Measuring Cholesterol Efflux

[0372] A transport-based assay can be performed in vivo or in vitro. For example, the assay may be based on any part of the reverse cholesterol transport process that is readily re-created in culture, such as cholesterol or phospholipid efflux. Alternatively, the assay may be based on net cholesterol transport in a whole organism, as assessed by means of a labeled substance (such as cholesterol).

[0373] For high throughput, fluorescent lipids can be used to measure ABC1-catalyzed lipid efflux. For phospholipids, a fluorescent precursor, C6-NBD-phosphatidic acid, can be used. This lipid is taken up by cells and dephosphorylated by phosphatidic acid phosphohydrolase. The product, NBD-diglyceride, is then a precursor for synthesis of glycerophospholipids like phosphatidylcholine. The efflux of NBD-phosphatidylcholine can be monitored by detecting fluorescence resonance energy transfer (FRET) of the NBD to a suitable acceptor in the cell culture medium. This acceptor can be rhodamine-labeled phosphatidylethanolamine, a phospholipid that is not readily taken up by cells. The use of short-chain precursors obviates the requirement for the phospholipid transfer protein in the media. For cholesterol, NBD-cholesterol ester can be reconstituted into LDL. The LDL can efficiently deliver this lipid to cells via the LDL receptor pathway. The NBD-cholesterol esters are hydrolyzed in the lysosomes, resulting in NBD-cholesterol that can now be transported back to the plasma membrane and efflux from the cell. The efflux can be monitored by the aforementioned FRET assay in which NBD transfers its fluorescence resonance energy to the rhodamine-phosphatidylethanoline acceptor.

[0374] Animal Model Systems

[0375] Compounds identified as having activity in any of the above-described assays are subsequently screened in any available animal model system, including, but not limited to, pigs, rabbits, and WHAM chickens. Test compounds are administered to these animals according to standard methods. Test compounds may also be tested in mice bearing mutations in the ABC1 gene. Additionally, compounds may be screened for their ability to enhance an interaction between ABC1 and any HDL particle constituent such as ApoAI, ApoAII, or ApoE.

[0376] The cholesterol Efflux Assay as a Drug Screen

[0377] The cholesterol efflux assay measures the ability of cells to transfer cholesterol to an extracellular acceptor molecule and is dependent on ABC1 function. In this procedure, cells are loaded with radiolabeled cholesterol by any of several biochemical pathways (Marcil et al., Arterioscler. Thromb. Vasc. Biol. 19:159-169, 1999). Cholesterol efflux is then measured after incubation for various times (typically 0 to 24 hours) in the presence of HDL3 or purified ApoAI. Cholesterol efflux is determined as the percentage of total cholesterol in the culture medium after various times of incubation. ABC1 expression levels and/or biological activity are associated with increased efflux while decreased levels of ABC1 are associated with decreased cholesterol efflux.

[0378] This assay can be readily adapted to the format used for drug screening, which may consist of a multi-well (e.g., 96-well) format. Modification of the assay to optimize it for drug screening would include scaling down and streamlining the procedure, modifying the labeling method, using a different cholesterol acceptor, altering the incubation time, and changing the method of calculating cholesterol efflux. In all these cases, the cholesterol efflux assay remains conceptually the same, though experimental modifications may be made. A transgenic mouse overexpressing ABC1 would be expected to have higher than normal HDL levels.

[0379] Knock-Out Mouse Model

[0380] An animal, such as a mouse, that has had one or both ABC1 alleles inactivated (e.g., by homologous recombination) is likely to have low HDL-C levels and higher than normal triglyceride levels, and thus is a preferred animal model for screening for compounds that raise HDL-C levels or lower triglyceride levels. Such an animal can be produced using standard techniques. In addition to the initial screening of test compounds, the animals having mutant ABC1 genes are useful for further testing of efficacy and safety of drugs or agents first identified using one of the other screening methods described herein. Cells taken from the animal and placed in culture can also be exposed to test compounds. HDL-C and triglyceride levels can be measured using standard techniques, such as those described herein.

[0381] WHAM Chickens: an Animal Model for Low HDL Cholesterol

[0382] Wisconsin Hypo-Alpha Mutant (WHAM) chickens arose by spontaneous mutation in a closed flock. Mutant chickens came to attention through their a Z-linked white shank and white beak phenotype referred to as ‘recessive white skin’ (McGibbon, 1981) and were subsequently found to have a profound deficiency of HDL (Poernama et al., 1990).

[0383] This chicken low HDL locus (Y) is Z-linked, or sex-linked. (In birds, females are ZW and males are ZZ). Genetic mapping placed the Y locus on the long arm of the Z chromosome (Bitgood, 1985), proximal to the ID locus (Bitgood, 1988). Examination of current public mapping data for the chicken genome mapping project, ChickMap (maintained by the Roslin Institute; www.ri.bbsrc.ac.uk/chickmap/ChickMapHomePage.html) showed that a region of synteny with human chromosome 9 lies on the long arm of the chicken Z chromosome (Zq) proximal to the ID locus. Evidence for this region of synteny is the location of the chicken aldolase B locus (ALDOB) within this region. The human ALDOB locus maps to chromosome 9q22.3 (The Genome Database, gdbwww.gdb.org/), not far from the location of human ABC1. This comparison of maps showed that the chicken Zq region near chicken ALDOB and the human 9q region near human ALDOB represent a region of synteny between human and chicken.

[0384] Since a low HDL locus maps to the 9q location in humans and to the Zq region in chickens, these low HDL loci are most probably located within the syntenic region. Thus we predicted that ABC1 is mutated in WHAM chickens. In support of this, we have previously identified an E_K mutation at a position that corresponds to amino acid 89 of human ABC1. This non-conservative substitution is at a position that is conserved among human, mouse, and chicken, indicating that it is in a region of the protein likely to be of functional importance.

[0385] Discovery of the WHAM mutation in the amino-terminal portion of the ABC1 protein also establishes the importance of the amino-terminal region. This region may be critical because of association with other proteins required to carry out cholesterol efflux or related tasks. It may be an important regulatory region (there is a phosphorylation site for casein kinase near the mutated residue), or it may help to dictate a precise topological relationship with cellular membranes (the N-terminal 60 amino acid region contains a putative membrane-spanning or membrane-associated segment).

[0386] The amino-terminal region of the protein (up to the first 6-TM region at approximately amino acid 639) is an ideal tool for screening factors that affect ABC1 activity. It can be expressed as a truncated protein in ABC1 wild-type cells in order to test for interference of the normal ABC1 function by the truncated protein. If the fragment acts in a dominant negative way, it could be used in immunoprecipitations to identify proteins that it may be competing away from the normal endogenous protein.

[0387] The C-terminus also lends itself to such experiments, as do the intracellular portions of the molecule, expressed as fragments or tagged or fusion proteins, in the absence of transmembrane regions.

[0388] Since it is possible that there are several genes in the human genome which affect cholesterol efflux, it is important to establish that any animal model to be used for a human genetic disease represents the homologous locus in that animal, and not a different locus with a similar function. The evidence above establishes that the chicken Y locus and the human chromosome 9 low HDL locus are homologous. WHAM chickens are therefore an important animal model for the identification and testing of drugs that modulate cholesterol efflux.

[0389] The WHAM chickens' HDL deficiency syndrome is not, however, known to be associated with an increased susceptibility to atherosclerosis in chickens. This may reflect the shorter lifespan or, more likely, the impaired absorption of dietary cholesterol in these chickens. We propose the WHAM chicken as a model for human low HDL for the development and testing of drugs to raise HDL in humans. Such a model could be employed in several forms, through the use of cells or other derivatives of these chickens, or by the use of the chickens themselves in tests of drug effectiveness, toxicity, and other drug development purposes.

[0390] Therapy

[0391] Compounds of the invention, including but not limited to, ABC1 polypeptides, ABC1 nucleic acids, other ABC transporters, LXR-modulating compounds, RXR-modulating compounds, and any therapeutic agent that modulates biological activity or expression of ABC1 identified using any of the methods disclosed herein, may be administered with a pharmaceutically-acceptable diluent, carrier, or excipient, in unit dosage form. Conventional pharmaceutical practice may be employed to provide suitable formulations or compositions to administer such compositions to patients. Any appropriate route of administration may be employed, for example, intravenous, perenteral, subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, intraventricular, intracapsular, intraspinal, intracisternal, intraperitoneal, intranasal, aerosol, or oral administration. Therapeutic formulations may be in the form of liquid solutions or suspension; for oral administration, formulations may be in the form of tablets or capsules; and for intranasal formulations, in the form of powders, nasal drops, or aerosols.

[0392] Methods well known in the art for making formulations are found in, for example, Remington: The Science and Practice of Pharmacy, (19th ed.) ed. A. R. Gennaro AR., 1995, Mack Publishing Company, Easton, Pa. Formulations for parenteral administration may, for example, contain excipients, sterile water, or saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide polymer, lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene copolymers may be used to control the release of the compounds. Other potentially useful parenteral delivery systems for agonists of the invention include ethylenevinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, and liposomes. Formulations for inhalation may contain excipients, or example, lactose, or may be aqueous solutions containing, for example, polyoxyethylene-9-lauryl ether, glycocholate and deoxycholate, or may be oily solutions for administration in the form of nasal drops, or as a gel.

[0393] Compounds

[0394] In general, novel drugs for the treatment of aberrant lipid levels and/or CVD are identified from large libraries of both natural product or synthetic (or semi-synthetic) extracts or chemical libraries according to methods known in the art. Those skilled in the field or drug discovery and development will understand that the precise source of test extracts or compounds is not critical to the screening procedure(s) of the invention. Accordingly, virtually any number of chemical extracts or compounds can be screened using the exemplary methods described herein. Examples of such extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- or animal-based extracts, fermentation broths, and synthetic compounds, as well as modification of existing compounds. Numerous methods are also available for generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of any number of chemical compounds, including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound libraries are commercially available from Brandon Associates (Merrimack, N.H.) and Aldrich Chemical (Milwaukee, Wis.). Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant, and animal extracts are commercially available from a number of sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, Fla.), and PharmaMar, U.S.A. (Cambridge, Mass.). In addition, natural and synthetically produced libraries are produced, if desired, according to methods known in the art, e.g., by standard extraction and fractionation methods. Furthermore, if desired, any library or compound is readily modified using standard chemical, physical, or biochemical methods.

[0395] In addition, those skilled in the art of drug discovery and development readily understand that methods for dereplication (e.g., taxonomic dereplication, biological dereplication, and chemical dereplication, or any combination thereof) or the elimination of replicates or repeats of materials already known for their HDL-raising, triglyceride-lowing, or anti-CVD activities, or known for their ability to modulate ABC1 gene expression or ABC1 biological activity should be employed whenever possible.

[0396] When a crude extract is found to modulate ABC1 gene expression, ABC1 biological activity, or a combination thereof, further fractionation of the positive lead extract is necessary to isolate chemical constituent responsible for the observed effect. Thus, the goal of the extraction, fractionation, and purification process is the careful characterization and identification of a chemical entity within the crude extract having HDL-raising, triglyceride-lowering, or anti-CVD activities, ability to modulate ABC1 gene expression, or a combination thereof. The same in vivo and in vitro assays described herein for the detection of activities in mixtures of compounds can be used to purify the active component and to test derivatives thereof. Methods of fractionation and purification of such heterogeneous extracts are known in the art. If desired, compounds shown to be useful agents for the treatment of pathogenicity are chemically modified according to methods known in the art. Compounds identified as being of therapeutic value are subsequently analyzed using any standard animal model of diabetes or obesity known in the art.

[0397] It is understood that compounds that modulate activity of proteins that modulate ABC1 gene expression or activity are useful compounds for modulating HDL-C levels and triglyceride levels. Exemplary compounds are provided herein; others are known in the art.

[0398] Compounds that are structurally related to cholesterol, or that mimic ApoAI or a related apolipoprotein, and increase ABC1 biological activity are particularly useful compounds in the invention. Other compounds, known to act on the MDR protein, can also be used or derivatized and assayed for their ability to increase ABC1 biological activity. Exemplary MDR modulators are PSC833, bromocriptine, and cyclosporin A. Other examples of compounds that may be assayed for the ability to increase ABC1 biological activity include oxysterols and their derivatives.

[0399] Screening Patients Having Low HDL-C or High Triglyceride Levels

[0400] ABC1 expression, biological activity, and mutational analysis can each serve as a diagnostic tool for low HDL or higher than normal triglyceride levels; thus determination of the genetic subtyping of the ABC1 gene sequence can be used to subtype low HDL or higher than normal triglyceride individuals or families to determine whether the low HDL or higher than normal triglyceride phenotype is related to ABC1 function. This diagnostic process can lead to the tailoring of drug treatments according to patient genotype, including prediction of side-effects upon administration of HDL increasing or triglyceride lowering drugs (referred to herein as pharmacogenomics). Pharmacogenomics allows for the selection of agents (e.g., drugs) for therapeutic or prophylactic treatment of an individual based on the genotype of the individual (e.g., the genotype of the individual is examined to determine the ability of the individual to respond to a particular agent).

[0401] Agents, or modulators which have a stimulatory or inhibitory effect on ABC1 biological activity or gene expression can be administered to individuals to treat disorders (e.g., cardiovascular disease, low HDL cholesterol, or a higher than normal triglyceride level) associated with aberrant ABC1 activity. In conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's genotype and that individual's response to a foreign compound or drug) of the individual may be considered. Differences in efficacy of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation between dose and blood concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's genotype. Such pharmacogenomics can further be used to determine appropriate dosages and therapeutic regimens. Accordingly, the activity of ABC1 protein, expression of ABC1 nucleic acid, or mutation content of ABC1 genes in an individual can be determined to thereby select appropriate agent(s) for therapeutic or prophylactic treatment of the individual.

[0402] Pharmacogenomics deals with clinically significant hereditary variations in the response to drugs due to altered drug disposition and abnormal action in affected persons (Eichelbaum, M., Clin. Exp. Pharmacol. Physiol., 23:983-985, 1996; Linder, M. W., Clin. Chem., 43:254-266, 1997). In general, two types of pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a single factor altering the way drugs act on the body (altered drug action) or genetic conditions transmitted as single factors altering the way the body acts on drugs (altered drug metabolism). Altered drug action may occur in a patient having a polymorphism (e.g., an single nucleotide polymorphism or SNP) in promoter, intronic, or exonic sequences of ABC1. Thus by determining the presence and prevalence of polymorphisms allow for prediction of a patient's response to a particular therapeutic agent. In particular, polymorphisms in the promoter region may be critical in determining the risk of HDL deficiency, higher than normal triglyceride level, and CVD.

[0403] In addition to the mutations in the ABC1 gene described herein, we have detected polymorphisms in the human ABC1 gene (FIG. 4). These polymorphisms are located in promoter, intronic, and exonic sequence of ABC1. Using standard methods, such as direct sequencing, PCR, SSCP, or any other polymorphism-detection system, one could easily ascertain whether these polymorphisms are present in a patient prior to the establishment of a drug treatment regimen for a patient having low HDL, a higher than normal triglyceride level, cardiovascular disease, or any other ABC1-mediated condition. It is possible that some these polymorphisms are, in fact, weak mutations. Individuals harboring such mutations may have an increased risk for cardiovascular disease; thus, these polymorphisms may also be useful in diagnostic assays.

[0404] Other Embodiments

[0405] All publications mentioned in this specification are herein incorporated by reference to the same extent as if each independent publication was specifically and individually indicated to be incorporated by reference.

[0406] While the invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications. This application is intended to cover any variations, uses, or adaptations following, in general, the principles of the invention and including such departures from the present disclosure within known or customary practice within the art to which the invention pertains and may be applied to the essential features hereinbefore set forth.

1 256 1 183999 DNA homo sapien misc_feature (1)...(183999) n = a, t, g, c or none; k = g or t; y = c or t; m = a or c; s = c or g; h = a, c or t; b = c, g or t; r = a or g 1 gtctataatg gcatgccaca gggctctaaa actttgcagt tttatcatta actcaaatga 60 aatgtataca tgccgctgac tcaacatttt gagagacaac aaatacaatg aatatcaaga 120 tacatatata tataatatgt atttcttttt gagatggagt ttcactgttg ttgtccaggc 180 tggagtacaa tagcacgatc ttggctcact gcaacctctg cctcccaggt tcaagcaatt 240 gtcctgcctc agcctcccaa gtagctagga ttacaggcat gtgccaccac acctggctaa 300 ttttgtattt tttaagtaga gatggggttt caccatgttg gtcaggctgg tctcgaactc 360 ctgacctcag gtgatccacc tgcctcagca tctcaaagtg ctgggattac aggtgtgagc 420 caccccaccc ggccatatat atatattttt gagatggagt cttactctgt cacccaggct 480 ggagtgcaat ggcttgatct cggctcattg caacctctgc ctcccatgtt cagatgattc 540 tcctgcctca gcctctcaag aagctgggat tacaggtgca tgccaccatg cccaactaat 600 ttttatattt catcatgggg tttcaccatg ttggccaggg tggtgtcgaa ctcctgacct 660 caagtgatct gcctgccttc ggcctcccaa agtgctggga ttaccggcat gaaccaatac 720 gcttggcaat atttttttaa gaaaaaaaaa tttcaggttg caacagcatc caaaaagtaa 780 ccaatgattt taggtgaagg gtgaagacaa atgtaaactc tttttttttt ttttgaaatg 840 gcgtcttgct ctgtcgccta ggctggagtg cattggtgca attgcgactc actgcaacct 900 ccacctcctg gactcaaacg attctcctgc ctcagcctcc cgagtagctg ggattatagg 960 ctcgcgccgc cacgccccgc taatttttgt gtttttagta gagacggggt ttcaccatgt 1020 tggccaggct ggtctcaaac tcttgacttc aagtgatccg cctgccttgg cctcccaaag 1080 tgctgggatt acaggcttga gccacccggt gaaatgtaaa ttgttaaacc tgtgtttttg 1140 aaaatgcata agtataggat aagggagaat tgactttctg aagaccagaa cattttagtc 1200 aatttcaaac acaatgtgag tcaattgtat aaaacaggtt ccttatcctg atgaggataa 1260 gaatagtatc cttgtcagat ggaaatgccc attcagctgt actttctagt ggttacgccc 1320 atagtagcac tgttgatgga accaggtatc tgactttagg aaagatgttc cccaactgga 1380 gctgacccag aggagcctga ccaacttggg gaaagtttaa agatctcatc acgtggagaa 1440 taggggaagg caccaacacg tattgagtgt ctactttgag cttaagggag aaggagaaaa 1500 ggcagggaat aaacggagga tggaataaga ataggtaatc ttccttaggt ttaaataata 1560 agtgcttgcc ataggaagga gccccagaac acagttatca ataatagaga ctcacacaga 1620 gcattctaca ctagagctgc tgtcctcttg accagaataa gggtaaggtg tgtgtgcgtg 1680 tccaggaaag taggcagcta ggaggtgatc agagcataca ctactgccgc cacaattcta 1740 agtgtcttcc cctaggggaa tcctatttct tctcaggcac atttgtttat tcattccatg 1800 ttcactcttg ttatttactt cttgccaggc tttgtgttaa gaattgggga aacaaggttg 1860 aataaaccca gtctgtaaag aaaaggagct cacagtctgg aggggcaaat gggcattgtg 1920 cctgcaagtt ggcccactga gagcctaaga agtgaagtta tgaatccagg attactcagt 1980 tatcaatgaa gtgattaaac atcatccata cagaccttca gagctggagg gaattttgga 2040 tacctactca gcacatagtt ttcaaacagt gccttgtgga accctagggc atttcttagg 2100 gattgctgtg tgtgagagag gagattgaat cagaaggtgt ctgggaccat tctctactca 2160 cacttcaagc agagcagctc cacttctatc tgtattatta tttttattat ttatttattn 2220 tatttatttt gagacggagt ctcgccctgt cgcccaggct ggagtgcagt ggcacgatct 2280 cagctcactg caacctctgc ctcccgggtt caagagattc tcctgcctca gcctcctgag 2340 tagctggaat tataggccta tgccaccacg cccagctaat ttttgtattt ttagtagaga 2400 cggggtttca ccatgttggt caggctggtc tcgaactcct gaccttgtga tccaccggcc 2460 ttcgcctccc aaagtgctgg gattacaggc attagccacc atgccccgcc tatctgtatt 2520 atttattcat tattgctatg tgaatgaacc tgaagaatgc ttactgttac tgctaagtat 2580 ttaaccacac cccatgccca tgcaggatga tagtgaatag tggccaaaag atactataat 2640 tagactcatg taattaagga atatttttgt cttgtaccta ttatgtgcct ataaagacta 2700 tgaaatctat ttattcagtg atttattgga ataccaaata agcaaagatc ctatgtgcta 2760 aagattctaa tattgtgcta agattttcct tcagatgttt ggctttctca aattccctga 2820 gggctagaac tttgccctac tcatttgtgt ttcccaagtg tctaacgcag tgcctgacac 2880 atacaggatc tccaaacgct tgctgaatgt gtgaggaagg aattaaaata atgtaccgcc 2940 gggcaaagtg gctcatgcct ataatcccag cactttgaga gaccgaggtg ggcagatcac 3000 ttgaggtcag gagctcgagc ccagcctggc caacatggtg aaaccccgtc tctactaaga 3060 atacaaaaat tagccgcgag ttgtggcagg cgcctgtaat cccagctact cggaaggctg 3120 aagtgggaga atcacttgaa cctgggaagc agaggtcgta gtgagccgag atcagaccgc 3180 tgagctccag cctgggtgac agagcaagac tccatctcga aaaataaaag taaaataaaa 3240 taatgtacta actggaccca gaacagattt tccaattgat tattgacaac aaaggaatct 3300 gaattattta ataaggtgaa taagtacata ttcatatata tatgtatatg tgtgtgtatt 3360 tacattttta taaaagtgta aaagtatata tacttttttt cctttcttca ggtagaaacc 3420 tctcctagat tgtcactgaa taaacattag cactaactat ggcaatcaaa tcacatattg 3480 attggtgtca gagaagaatt gaacattcaa ctctgaagca gtgttatttc ttccatctgc 3540 aaacactctg tcccccatcc httctttgtg tatcctggaa tccaagtcat aaataatgat 3600 aggtatttct gtcaaaggta tttctcagag gagtgatatg taactccctt tcctctgata 3660 cactgactca ctaagcagct acccttgtga attccaatta gcaatacttc ttgctatgtc 3720 tggtccaact ttcagacaaa cctagtgttc aggattccta tagccattta taggtgtaga 3780 caggaagcat tcagatatcc ccagaggtac ctgataacca gctgatccat gactgtctgt 3840 cttgggcttg gccagcttga aatcttgaca ttgtggttct cncccagaga aggtgccttt 3900 tggatgtgag ataaagacat tatgactaga tagtgcatgg tgagggtgtt tttctagttt 3960 taccgaagtg ttgatctgta aagctgctac cagcacacac nnanncacac acacacacac 4020 acacacacac acaattaacc acagatatcc tcatgggaaa ttgtcttagg aaagaatgga 4080 agccaagatt ttatatatag accacagaag gtgatgggta atgttcttgg aagggagttg 4140 acaggcaata gctataagtt aactcaggaa agcaaagaaa atccccaaag agctaaggga 4200 gaggttagag attctgcttt ttattagcaa ttcatagctc tcaagtttca tacagtcttt 4260 aaggctcccc tcttcatata gaataaatga aattatttta taaattgttc ctcagattcg 4320 tatctgtaca ttctgggacc acgagttgta gcaggatgtg attttcctca ttctgggcat 4380 ctaagttcta cagttaagga cactgaaaca aacctttagt cgaataaaga ttggcacatt 4440 gtttcttctc ccataaacat tgaatggtcc aggaagggcc aggtgtggtg gctcacacct 4500 gtaatcccag cactttggga ggccaaggca ggtagatcac ccgaggtcag gagttcaaga 4560 ccagcctggc caagatggtg aaaccctgtc tctactaaaa atacaaaaag gaggagtggt 4620 agtcccagct tttgcatgaa tgtccacatt gggactttat cttaaatgtg caagacacaa 4680 ctcttggcca ggcgtggtga accgagattg tgccactgca ctccagccag agaatgcact 4740 ccagcctgag caacagagcc agactccttc tcagaaaaaa aaaaaaaaaa aaaaagacaa 4800 ctcttaaatt ggtgttctac tgtccttgaa ccaccatacc tttgcagaag tcattctaga 4860 tcatcaatct gaccataaaa cacagtttgc caaactggca actaactgca cctatttgtg 4920 attagtttgg aagaaacttg aaagtgcccc ttttaactgt ccacttattc cctgcctggg 4980 cacggaattc tctcccacgc ttaataacgg acacttttaa aaattatttt tactccccac 5040 agagttggca gattgctgtt tcagagagtt aaatggaatg cctccagtgg aagtatcccg 5100 ttttctagaa tggaaagtct atcttcacag tgtcataatc caggtgccct gggctgagac 5160 ttcccctgcc taggcggtac cctgggtagc acagctgaac tggctgtgaa ctaaacattc 5220 attttttatt agcagccagg ctggacagag atcacacaag accaacctga caacagcagc 5280 atgctgctct gcttcagaag taattttttt tttttttgag acaagagcct tgctctatca 5340 cccaggcaaa gtgcagtggc atgatctcgg ctcactgcaa cctccacctt ctgggatcaa 5400 gcgattctcc tgcctcagcc tcccaagtag ctgggattac aggcgcatgc caccatgcct 5460 ggctaatttt ttgkactttt gtattttttt tttttttgag atggagtcct cgctctgtcg 5520 cccgggctgg agtggcctga gccaccgcac ctggcccttc agaagtaatt tttaaggaag 5580 gatcttgtcc tgggtggggg gttgggcaag gctgtgaaaa gcaaacaaaa atcacatgtg 5640 gcctaatagg aggccagtgg aaatacacat gatgaaaaag aaacttacaa aagcacatta 5700 ttaatttctg aacatgctaa taccatccaa taacaataag atctaatatt tattgaatgc 5760 tgattactca tcagctacat tccaagtact tctacatgta cagttatgtg acacataaga 5820 tgtttaggtc accactgaca gaatgtatga tggtgacccc ataagattac aaaaattcct 5880 attgcctagt gacatatgta gcaagatgca ttactcacgt gtaataagta atgtgatgct 5940 gctgtaaaca aacctactgt gctgccatgc ctatagaagt ctaggacggg ctgggcacag 6000 tggctcatgc ctgtaatccc agtacggtgg gaggccaagg caggccgatt gcttgagccc 6060 agaagttcaa gaccagccgg ggcaacatag tgagagcctg tctctacaaa aaatacaaga 6120 acaaattagc caggcatggt ggcgcatgcc tttggtccca gctactcagg aggctgaggc 6180 aggaggatca cttaagccca ggaggttgag tctgcagtga gccatgatgg tgccactgca 6240 ctccaacctg gagagagagt gagacccggt tttaaaaaaa aaaaaaagcc taacaaataa 6300 aagtatgtat agtacagaat acttggtaat gataaacagt atgttactgg tttatgtgtt 6360 tactatactt tttatttttt agagtatact cctacttatt taaaaagaaa aaagaaagtt 6420 aactgtaaaa caggcttaga caggtccttc aggaggtatt ccagaagaaa ggcattgtta 6480 tcacaggaga tgaaagctcc atgcatgtta ctgcccttga agaccttcca gtgggacaag 6540 atgtgaggtg gaagtctatg atagtgatga tcctgaccct gtgtaggcct aggctaatat 6600 gtgtgttttt aacatattag tttttaacaa caaagtttaa tcagttaaaa caattttttt 6660 aatagaaaaa agctcataga ataaggatat naaagaaang acaatgtttt gtttgtttgt 6720 ttgtttgttt tttgagacag agtctcgctn ccatcacctg ggctggagtg cagtggcaca 6780 atcttggctc agtgcaacct ctgattccca ggttcaagta attctcatgc ctcaacctcc 6840 caagtagctg ggattacagg cgcccatcac cacacccaac taacttttgt ctttttagta 6900 gagatggggt ttcatcatat tggccaggct tgtcttgaac tcctgacctc aggtgatcca 6960 ccctcctcgg cctcccaaag tgctgggatt acggttgtaa gccatcgtgc ccggccagaa 7020 agaaaatgtt tgtacagtat gtacgatgta tttgttttaa gctaagtgtt attatgaatg 7080 aatcatgtgt ggtgtagtgg ctcatgcccg taatcgcatc actttgggag accgaggacg 7140 gaggatagct tgagtccaga gttctagacc agcctgaaca acgtggtgag acctcgtctc 7200 tacgaaaatc atcaaaaatt atccatgcgt gggggtacat acctatagtc ctagctactc 7260 aggagactga ggtgagggga ttgcttgaac tcaagaagtt gaagctgcag tgagctataa 7320 ttgtaccact gcatttcggc ctaggtgacc ccgcctcaaa gaataataat aaaagctggg 7380 cacggtggct cacgcctgta atcccagcac tttgggaggc caaggcgggc ggatcacttg 7440 aggtcaggag tttgagacca gcctggctga catggtgaga ccccgtcgct actaaataca 7500 aaaaattagc cggtcgtggt ggtgtgtgcc tgtaatccca gctactcagg aagctgaggt 7560 gggagaattg cttgaacctg ggaggcagag gttgcagtga gccgagatag cgccactgca 7620 ctccagcctg agcaaaaaac aataataaat aaataagttt gaaaatttaa aatgtttata 7680 aaattaaaaa gttacagtga gctaagattt agtattaaag aattttttta taaacatgta 7740 gtgtaactct acagtgttaa taaactctac agtagtgtac agtgacatcc taggccttca 7800 cattcactca tcactcactg actcacccag agcaacttcc agtcctgaaa gctccattca 7860 tggtaagttt cctatacagg cgtgccattt aaaaaatctt ttaggccagg cgcggcggct 7920 cacgnctgta atcccagcac tttgggaggc tgaggcaggc ggatcacgag ctcaggagat 7980 cgagaccatc ctggctaaca cggtgaaacc ccgtctctac taaaaataca aaagattagc 8040 tgggtgtggt ggtgggcgcc tgtagtccca gctactcggg aggctgaggc aggagaatgg 8100 catgaaccca ggaggcggag cttgacagtg agctgagatc acaccactgc gctccagcct 8160 gggccacaga gcaagactcc atctcaaaaa aaaaaaaaat cttttatact atattcttaa 8220 tgtacctttt ctatgtttag atacacaaat accactgtat tacaattgcc tactgtattc 8280 agtacagtaa catgctatat ggttacgtag cctaggaaca ataggctata ttgttcctag 8340 gtatagggat gtggtatcct ataccatcta ggtttgtgta agtttatttt atgatgtttg 8400 catgacgaca gagtcaccta aggactcatt tcttagaata tatttgtagt taagcaatgc 8460 atgactctat tgactcatga attcttacca cagacctatg gggcagtact attgttaccc 8520 tcattttata aatgataaaa ctgaggtaca gagacagtaa ataacttgac cacggtcatt 8580 cagctactca agagtcaagg ctgggattta aaaccagatc acatggtttc agagtgttca 8640 cacttaccta ctatactgtc tcaagagcaa ggatgttttg gttcacttga caaatgaaga 8700 tagggacctc tttcattata agcctatttt aggctaaaat agaagggaag gggacacagt 8760 gaatccaggc cttctggcat ggctcctcag ccctttctga gctggcctgg gacagccttc 8820 ctacctcact gatgccactt cctactgagc gactttcctg cctacctcac tgatgccact 8880 tcctactgag cgactttcct gggctccaga cccagtaagc gactttgcct gcacccacct 8940 tattttgctc tactccctgt gcttttatgc ctttacccat ctgccctgga aagctcttct 9000 aacctttgaa tgggtaaagg cataaatgta tgctggagaa atcctcagct cagggccagg 9060 cacgctggct cacgcctgta atcccagcac tttgggaggc cgaggtgggt agatcacctg 9120 aggtcgggag ttggagacca gcctgaccaa catggagaaa cccctgtctc tactaaaaat 9180 acaaaattag ctgggcgtga tggcacatgc ctgtaatccc acagctactc gggaggctga 9240 ggcaggagaa ttgcttgaac ctgggaggcg gaggttgcag tgagccaaga ttgcgcctct 9300 gcactccagc ctgggcgaca gagctagact ctgcctcaaa aaaaaaaaac aaaaaaaaga 9360 aaaaaaagaa aaaaaagaaa tcctcagctg agttgtcaac tcctctttga aactttctca 9420 gacctttcag gctgagtcgt cgctcatttg tgcttcctca gttcctggct tctaccttct 9480 tcatagcttg tttcatgtaa tgttaattct tacttgcttt ctccctcttc taagctgaga 9540 gctacttcaa agcatgggta ggacctagca cggtgtatgg gacatgggtg gtaccccgta 9600 aatgtttact gaaaaaaaaa tgcctaaagc aattgttaac atcatcagat agataattat 9660 gggcattcag agattctgtc ttcaagctta tataaagaac ttatttttgg ctctaattat 9720 cctgataatt ttctcattac tttcacttat tgtggcttgt ggatcaattg ttgacatttt 9780 ataaacattt cactatttga caatgatgat actaaaatac gaattaagca accattctaa 9840 agatagtgat gatgataaca tatacgctgg taacatcttt attttcagcc gtatcatgga 9900 atcctctgtt tccattctgc taggtaggca ggtatgcagg tagaacttgt gagaggatat 9960 gatttttgtt tccatcttag atatgacagg aacttggaat ttttgacata aatgacgaac 10020 atccgggatt cttaaacaat ctttaaaaat ggaatgcctt aaaagctggg cgcagtggct 10080 cacgcttata atcccagcac tttgggaggc tgaggcaaat ggatcacttg agttcaggag 10140 ttcaagacca gcctggccaa catggtgaaa ccccatctct actacaaata caaatattag 10200 ccgggcgtag tggcaggcgc ctgtaatccc agctacttgg gagcctgagg caggggaatt 10260 gcttgaaccc aggaggcctt ggagattgca gtgagctgag actgcgccat tgcactccag 10320 cctgggcaac aagagtgaaa ctccatctcc ggaaaaaaaa aaaaaaaaaa aaggaatgcc 10380 tttgggaata atttatttat aatttatgta taacatatag acaaaccatt agtttgtctt 10440 atattttact aaatataaat ttagtaaata taaatattta ctaaatataa aaactcttag 10500 attttactaa agagttacaa ctaattggcc tggcgtggtg gctcacacct gtaatcccag 10560 cactttagga ggcagaggtg ggccgatcac gaggtcagga gatcaagacc atcctggcta 10620 acacggtgaa actctgtctc tactaaaaaa aaaaaaatac aaaaaattag ccgggcgtgg 10680 tggcaggccc ctgtagtccc agctactcag gaggctgagg caggagaatg gcgtgaacta 10740 agcagaggct tccctaaaag tgatcttcag gataaaggca gaggaagagg ctccatgact 10800 gggattggtg ttgaggagag ccagagaagc aagctacaga aaagagaaaa aattaatatg 10860 caagagagta aacaacacga aggaaaagaa cccagtgtgg aaacactaca cgtgagaaag 10920 gtgtctgtaa ggatgttcta caaagcaaat gcttggatat taattcattg cagcaggaga 10980 tggtaagcct catgataaag aaggagaaaa aatcaagtca agggctctga ggtactgacc 11040 caggtatact tgactatgcc agcaactgtt tagggggaga tttgagctac acttgtagca 11100 aaggcaaaat ctgtaattag ttgtaactct tttttttttt gagatggtgt ctcgctctgt 11160 cccccaggct ggagtgcagt ggtgtgatct tggctcactg caagctccgc ctcctgggtt 11220 caagtgattc tccagcctca gcctcccaag taattgggac tacaggcatg caccaccatg 11280 cccagctaat ttttgtactt ttattacaga ccatgttttg ccatgttcac caagctggtc 11340 tcaagctcct gacctcaagt gatccgtccg cctcggccnt cccaaagcgc tgagattata 11400 ggcctgaacc accgcgcctg gcctaaagag atctaattct tagcaaagtt tcaccaggga 11460 gtctctcctc acccccaccc catccttccc acaaagaatt agaacaatgt ccctactacc 11520 cctgctgtat ctctgacttt ttactttaaa tctcagcaga atattttact aaatgttttg 11580 atgtggttat ataaaatcat ccctgctgac aaggaaacac tttttgaaaa aagttttcat 11640 tatcaaacag taagtacagc tgactgccgt gacctttaac ccatttctga gtctcccctc 11700 attggacttg ggtggagggg actggtacca ataaagtcaa atgcttaata atttatgcaa 11760 gtgcttgaag aaatttgaag ttgaatattt ctatcatctt gaaatggaga aagaatctgt 11820 aaacagcaaa gccagacgcc ctaaaggaaa agatttacag attaaaataa gattgcaatc 11880 tggtaaaaat atttgcaaca catgtaacag tcagaaagtt gaaacacttg gtttaacawg 11940 agcttttawc agataaataa ggaaagaata aacattggat tttaaacacc gataaacatg 12000 aaaagatgtt taatctttat ttttatttaa tccatattat ttttcagttt aatcaaagaa 12060 aatataaatc aaaacaataa tacattttat atatatatat atatatatat atatatatat 12120 atatatatag taggcataag ttaaagactg ataagactgt tgaaaaggga tgaaaaacta 12180 ggcttactca taccaatata tatctattag gatggctaaa gtaaaaaata ctgaaaatat 12240 caagtgctca aaaggatatg gagcaattgg aaccctcaga catcgctgrt ngagaaaaca 12300 aaatggtaca gccaccctgg agaacagttt agctgtttct tgtaaagtta aacatgcgct 12360 taccatatga ctcagcaatc tcactcctgg gtatttatgc taggaaaagg aaaatttata 12420 cttgcacaca aaaaacttgt aagtgaatct ttatagcagc tctattcata actgccaaaa 12480 actgagagaa aatgtccttt aatgtgtgaa tggataaacc aactgtgcaa catccatgta 12540 atgaaatact acttagcaat aataataata atattaaaaa ccccagaacc attgatgcat 12600 gcaacaaata tggataaatc tcaaaagcat tatgctgagt aaaagaagtc agtctgaagg 12660 atttcatact ctaggattcc atttatataa cattattgaa acgacaaaat tatggggaca 12720 gagaatagat cagcggttgc caggggttta ggtgtgtgga gagggtgtgg ctataaagaa 12780 catgcaaggg aatttttttg ggagatgaaa tggatctgta tcctaattat ggtcatggta 12840 acacaaatct atacatgtgt ttagattcat agaactgtat accaaaagaa aaaaagtcat 12900 ttttactctc ttaaaatgaa aaaagaaaaa gcctgggcat tctaacacct tgtttgtgag 12960 agtacacatt gataccaagt tttatggtgg gcaattttgc tataaatacg gaaagtttgt 13020 ctgttctatt attcagcaat cccagttttg caaaactatg ctaaagaatc tttgggggcc 13080 gggcacggtg gttcacgcct gtaatctcag cactttggga ggccgaggtg gacgatcacc 13140 tgaggtcagg agtttgagac cagcctggcc aacattgggc aaccctgtct ctactgaaaa 13200 tacaaaaatt agccgggcat ggtggcgcat gcctgtagtc cgagctactc gggaggctga 13260 ggcaggagaa tcacttgaac ctgggaggca gaggttgtag tgaactgaga tcgtgccacc 13320 gcactccagc ctgggcaaca gagtgagaat ccgtctcaaa aaaaaaaaaa aaaaaaaaaa 13380 caaaacaaaa caaaaaactt tgtgtacgtg tgcaaagaga atacaaagat gatcatggct 13440 gcatttttta aatgactata aaaaagaggt acaaccagcc aggtaaagtg gtgtgcacct 13500 gtagtcccag ctactcggga gggtgaggtg agaggaacac ttgagtccag gagtttcagg 13560 ccagcctggg caacatagtg agacccctgt cccaaaaaca aaacaaaaca ccaaaatgtc 13620 tatctgtagg aatttgtttt caagttgtga tacataggta cagtgaaata ttatacattc 13680 atttaaaatg atgataaaat ctgtatttgt ttacatgaaa aactgtccac tataatgtag 13740 tgaaaataat agattacaaa caatatatat ggaataaact tgtttagaaa caatttctag 13800 aagaaggtac aatggacaga attatctctg ggaagtgggt ttataatgat tctcattttc 13860 ttctttgtat ctttttcata gtctttctac ttttgttatg tctggacatt tgattatgag 13920 catgtattac tgatctattt taaaaattga ttttaatttt tacaaaaact catgtaaagc 13980 ttgaaggttc gcattttaga ccatgttaaa attttcctgg atcaaaacag acttattcaa 14040 atatcttgta ccctgtcctc caaaattgcc tgccaaaata cactacaaaa gagagcattt 14100 agctgcatat tttttggact gctgagatca acaatattat ttaccatggc ttaaattttt 14160 accctccagt atgtgtgggt tacaaacact cttccacatt tttgaggcat tgcttttgat 14220 attttaaatg taaattcagc tgtgcgcggt ggctaacgcc tataatccca gcactttggg 14280 acgctgagga aaggatcact tgaggtcagg agtttgagac cagtttagct aacgtggtga 14340 aaccccgtct cttttaaaac tacaaaaatt aaccgggcat ggtggcaggc acctgtaatc 14400 ccagctactc aggaggctga ggcaggagaa tcacttgaac ttgggagaca gaggttgcag 14460 tgagccgaga tcatgccact gccctccagc ctggccacag agcgacactc catctcaaaa 14520 aaaaaaaaaa aaaaaaaaaa aaaggccagg cgcagtggct cacgcctgta atcccagcac 14580 tttkggagcg ccaaggtggs sggatcaact gaggttggga gttcacgacc agcctgacca 14640 acatgcagaa acccygtctc tactaaaaat acaaaattag ccgggtgtgg tggtacatgc 14700 ctgtaatccc agctactcgg gaggctgagg caggagaatt gcttgaaccc aggaggtgaa 14760 ggttgtgttg agctgagatc ccgccattgc actccagcct gggcaacgag caaaactctg 14820 tctcaaaaaa accgaaaaaa ttcccccaaa aaaccaaaaa aaaacagcaa caacaaaaaa 14880 atcaaataat gtaccttgtt tagcataaag cataattata tgcatatggt gattgggagg 14940 atgaaatgga aaggttattt attactgact tcagaaatta tgtcctgata gattgatggt 15000 tgatttaaat ataacttctt gtcaagcatc tgtttttaga atcaaattac tatgactctg 15060 cagtttcctt gaaatctcat agtatcacat ctctgtttgc ctttgcatgg ttttaagaaa 15120 atgaggagtg tgaaacttca aacttcgttt tcattgtatt acatttttga atgacacact 15180 ggtcatttcc tagaaatata aggtgacaaa tatttcacag aaacataagg tgctattatc 15240 tcattcaatc ttaggtcact caaaactctt tctctcccac acattgaaga ttcatttggg 15300 aatgcttttg tcttattgtg cacccccagt gaaggggtgg taagtgtttt tcattttgct 15360 tcctttgttt atctacaggg ttccattcaa taaacaaagg gacttgggtc aaacttcagg 15420 ctcttatggg tttggatgta atctttggtc tcatttttag ttaccaacag agagtgttgc 15480 ttctgacctc tttgactctt ccctgctgaa tttactatgc ctttgatact tgtgaagggt 15540 gagattttcg aggagtactg ttgtttttgt tagaggttgt aatgtctttc ttcgctttgt 15600 gattcaagtt gtgttcagtt acaatcataa gcatgtgcct aaaaaaatca gatgcaaact 15660 agcaaaagta gaaactcagg gtgacagtct ttaagaaaag atgcaattct tggggctggg 15720 tgcggtggct tatgcctata atcccagcat gttgggaggc caaggtgggc agatcgcccg 15780 aggtcaggag ttcgagacca gcctggccaa tgtggtgaaa ccctgtctct actaaaaata 15840 caaaaattag ctgggcgtgc tggtgggtgc ctgtaatccc agctactcag gaggctgagg 15900 caggagaatt gctggaacct gggaggtgga ggttgcagtg agcctagatt gcgccattgc 15960 actccagcct gggcaacaag agcgaaactc tgtctcaaaa aaaaaaaaaa aaaaaaagat 16020 gcaactctta ttactgacac agaaatgaaa atttagttac atagtattgt aaaaggacta 16080 tcagctaggt ttagccttac caagatttag gtaattcatt tcctgctaca ctcatattct 16140 cagccacttc cctcatcaca ttttcagggt gcagtatata atagcgtcaa ctcgtgtaat 16200 tccccctact ccccatgaac ttctaggcca aggggccaca cggggtgggg catatagtat 16260 aaaggagtaa ggcagactgt tggagaaaaa cagggattag gccaggtgcg gtggctcacg 16320 tgtgtaatcc cagcacttta ggaggctgag gcgggcggat cacgaggtca agagtttgag 16380 accaggctga ccaacatggt gaaaccctgt ctgtattaaa aatacaaaaa ttagctgggc 16440 acggtggcag gcgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga 16500 acctaggagg cggcggttgc agtgagccaa gatcacacca ctgcactcca gcctgggtga 16560 cagagtgaga ctctgtgtca aaaacaaaaa taaatacaaa aacaaaaaca aaaacaaaac 16620 aaaacaaaaa acaaggatta gtgaggactt tgcaaagtgc aaacagcatg ctccatctaa 16680 agagagcttt tcagtactag ctgattgttg ttgtgtggaa atacgtcttg tatggtgaga 16740 tcttccacct tttcaacaga aatgagaaac acaaacattt ctgtgtgaca cttccaaact 16800 tgttggctat ttctgttttt ctttttttct ttctttcttt cttttttttt tttgaggcag 16860 agttttgctc tcgttgccca ggctggagtg caatgacgtg atcttggctc gccgcaacct 16920 ctgcctcctg gattcaagcg attctcctgc ctcagcctcc caagtagctg agacatgtgc 16980 cgccatgcct gactcatttt gtatttttag tagagacggg gtttctccat gttggtcagg 17040 ctggtctcga actcctgacc tcaggtgatc catctgccta ggcctctcaa agtgcaggga 17100 ttatgggtgt gagccactgc gcccggccct agttggttat ttcaaacaaa atactaacac 17160 tcttcagccc aaaccatgtg tcagagagct gggtgtggcc agtgggcccc tggtgtgcaa 17220 ctagtcttca gtgatgctac tgtatgctag tgacagcaca actgaccact gaaaatatgg 17280 tgcacgatgg tgaacagatt ggatttcccg tatctctaga accccagggc tttganggca 17340 attctgatag atggaggggg agcgaganga gagaaagagg agggagaggt atcacatgaa 17400 agagatactg gactttctgc taataaatgg gtgggtattt gggtgagtag ataggaaatt 17460 aaaaagaaat atgagagatg ggcgtggtgg cttgtgcctg taatcccagt gacttgggag 17520 gctgaggtgg gagtattgtt tgagaccagg agtttgagat cagtctgggc agcatagtga 17580 gaccccgtcc ctaaacataa aattttaaaa aatcagncta agcgtggtgg tgcgtgcctg 17640 taatcccagc tacttaggag gctgaggcag gaggatccct tgagcccagg agttagangg 17700 ctgcagtgag ctatgattgt gccattttac actagcctgg gtgagaaagc aagactccca 17760 catctgagac agagagagag gagagaggca gaccatattt gaactccagg ctggtgaata 17820 attcatactg attaatttaa ttgaaaatat ttatttaaaa tttgcagatc ggagttttgg 17880 aaaggttaca tttattaaat tattgttgga ctatgaggaa cacattgttg cttggaataa 17940 tatttaatta tgatgatgat gatgatgata ataaatgaca ataataaaca actatcattt 18000 attgagcaca taccatatac caggcccagt accaagcagt tcacatatat tatacatcta 18060 atacccagag caactctata aggcagctct atatgttgct tccatttgaa agatgaataa 18120 actgaggtac agagatgtga agtaacatgy tgatggacac ccagatatka tytkcagata 18180 agaaggataa gtcagaagat cagataagca gaaaggtttm macacacagg actytcattt 18240 agtagtaaac tyttatgatt ggtctaaaat tctattttct tttacagttt acctactcat 18300 caaatctgtc ttaactattt ttgtctctcc aaattctctt ctgttctttk ktactatgta 18360 cacatgcttt taaaatatgt taggatttgc tgtttccatg tgktaatttc ttgtttgcct 18420 ttcttcattt ataaactctt tcctcctact aattattgac aaactctatt tttctttttt 18480 tttttgaaaa actctatttt tcatatcatt tttctaaatt taaaaatagc atataaggct 18540 gggcatggtg gctcacacct gtaatcccag cactttggga ggctgaggcg ggtggatcac 18600 ctgaggttgg gagtttgaga ccagcctggc caacatggtg aaaccctgtc tcaactaaaa 18660 ttacaaaaaa aaaaagttag ccgggcatgg tgatgggcac ctgtagtccc agctactcag 18720 gagattgagg caggagaatc gcttgaaccc gggaggtgga ggttgcagtg agccgagatg 18780 gggccactgc attccaacct gggcaacaga gtgagaccct gtctcaaaaa ataaagtata 18840 tatatatata tatatataaa tatttacgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgta 18900 tacatatata tagcatgtgt ttattttata aattttgtaa aatacagaca tatatgaaaa 18960 aattacctat aatcttacca tccacaggtg attgctgtta gcacctatct gggatttttt 19020 ttttcttttt tctttttctt tttttttttt tttgcagtct tttctaggca catatattca 19080 atgaaaaagg ttatcaatat agtaacagtt attttttatt gaatgttaac catacacata 19140 aatatagctt tgcttttttt ttcaattcgc atttaccatg agcagtttct ctatatcaca 19200 actctttgaa aacattattt tcacctttca tggctatata acattctatc atgtgaattt 19260 atcaatgtgt attttagaat tgttatcasa tgattaagca tttttaaaat gttcactcat 19320 caataaaata gtgacaacat tcttgaacat aagttttttt taccatacta tcggatattt 19380 ccttagtata aattcctcca agtaaaatta gtgggccaaa ggcttttttt ttttgaggcg 19440 gagtttcgct ctcgttgccc aggctggagt gcaatggtgt gatctcggtt caccacaacc 19500 tcctcctccc aggttcaagc aattctcctg cctcagcctc ctgagtagct gggattacag 19560 gcatgcacta ccatgcctgg ctaattttgt agttttagta gagacagggt ttctccatgt 19620 tggtcaggct ggtctcaaac tcctgacctc aggtgatctg catgcctcgg cctcccaaag 19680 tgctgggatt acaggcatga gccactggtc ctggaccaaa gcctcttgat acattttgct 19740 aaactatcag aatgttgggc ctctttgcac cctactaaca ccctcagttt ttatgagtat 19800 gattacatta ttttccctct tttccccaac ctcctattct attttcctta gataccttaa 19860 gactgcaccc ccaacttagc tttctgttct cattcaggca gtgtgtccag ggcttccaat 19920 cccagtggtg agtaggcaga tgtgggggct tcctgttgaa ctctgttagg taattgagcc 19980 actcaagggg aaaccagctt tggggtccgc tctgtggcaa gcccaggact ctatngaatg 20040 gagctcattc ctcactaatt tacacaccag gacaagtagc cangctaaga aattgctcta 20100 atttcactgg tcgtcagtca acaaaagaat gaagatacag aaggaaaata aaaggcctgc 20160 tcccacagtg gggtcatagc acatctccac atttaacctt tgacaaggca aaagtatttc 20220 ttctgtccag acaattccag aagtcataag gaatactggg caaggatggg tcaaaacaac 20280 acagactatt gagggcaaca aaggaaaaac catgcactcc tgaagaaata gatcccaaaa 20340 tgctgaacga gaaagaggat tgtctctcta gaagtctaat agaaaacagt atctggaaca 20400 ttaaacgtag gatttttttt ttttctaatg acttactctc ttattaatat gtcagagaaa 20460 agaatagctc ctggctaaga aatacaacag tccctcatcc cagaaatcac agccaggaat 20520 atggattctt aagtgtcaaa aaagtactct gaataaggaa agaaacgcag atgcactact 20580 tctattataa taagcgcttg atttctttta atcccctgag tccaattatt ttcttggcaa 20640 atttaagggt actgactgct tctgtgaatc tattgttaca cttgataatg gatctgagtt 20700 gggggtaata tttgcctatc aatttggata cttaaaaatc tctctctctt tcttcatata 20760 ccctctatct cacaactttc catttaatga gggaagtgaa tttctttttt ctgcccctct 20820 ttcctactgc ttctagaata aaagcataac aggacgacag gagtggagat gagaggaagc 20880 atttccaagc aatgggaaag tatgatgaga gtcgtgagtt ggtagaatgg gggtgagtaa 20940 ggggtggggt gtggatggat ggaagaggat ggaggaggaa gcaggtcata tgatcaggct 21000 tcaaaggcct tccaatatct tgttttcaga ggctttgtag gtcttcttac agaactttga 21060 gggtattttt cctttgtgag caaaggagaa ccatagtaaa ttttgaggga tgggctgggc 21120 gcggtggctc acgcctgtaa tcccagcact gtggcaggca gaggcaggtg gatcatttga 21180 ggtcaggagt tcgagaccag cctggccaaa atggtgaaac cccgtctcta ctgaaaaaac 21240 aaaaagttag ttggatgtgg ttgcatgtgc ctataactcc agctactcgg gcagctgagg 21300 cagcagactt gctggaaccc gggaggcaga tgttgcaatg agttgagatt gtgccactgc 21360 actccagtct gggagacaga gtgagactcc atctcaaaaa aaaagaaatt tttttttttg 21420 gagggagaaa atatatgatt agattgtttt tgtgttggtt tgcttgtttg tttttttccc 21480 caagcaaaaa atcactttac tgcaataggg aagacaaata ggaaggggaa gaaactagag 21540 acaggaacag cagtttggag gttctgcaat acagaagccc agatgtttgg cttggactta 21600 gacactggga atgaaaaata aatgatgaat taaaaaaata aaatatttgg gaggtatact 21660 tgacctgacc ttggtgcttt ttcaaatgag aagagaaaga tagagacaga tgaaaaagtt 21720 aagcaaggat gactatgatt tcccatagtg aatgtctgga tgatgatatc attaaatgaa 21780 aatttttaaa aaggcagatg gtagaaggag atttgaagga aagacaagaa atttgtttgt 21840 tttggattta cttgtagaat gacctgcaaa attttgtcct aataaatgtt aacaagtggc 21900 tttccataca aaaaccaaac caaacaaaac cctgatgtaa accaaaaata aaattctgag 21960 gccccccttc accatctgaa tgaacttcct cctctgcaag ggcactctta aaatttaaca 22020 tgaaagactg gttcaggtca tgacgggaag tgggggtcgg acaggcctca ttatgcctct 22080 ctggcattaa catcaacaca gaccttaagt ctgttaagaa gcatttacaa tctattctct 22140 ctgaagcctg ctacctgaag gcttcctctg cacactgaga actttggtct ccacaatcct 22200 ttatcttaag ccagacattt cctttctatt gatcccaggt ctttagataa actcaaccaa 22260 ttgtcaacca gaaaaatttt aaatctatct ataacctaga agcccccact tcaagttgcc 22320 ctgccttttt gaactgaacc aatgtatttc ttaaacttat ttgattgaag tctcatatct 22380 ccctaaaaac caagctgcac cccaaccacc ttgggtgcat gttcttagga tctcctgagg 22440 gctgtctcct gagggccaag atcactcata tttggctcac cataaatctc taaatatttt 22500 acagagtttt actcttttca tcgacactga tttatattgg atttccaatg gtgtaatatt 22560 catgccgttg gccgatttca agctagctgg tagagatatg caaacaaaca aacaaacaaa 22620 aagagatatg caaaaacaac tcttgtgagc ttgtgtgctg gctccaacac accatgctgc 22680 tggatattct accaggcagc atagggcgca cggtgggtga gcctggcaag gagacaagag 22740 ctggaggtac agggcttaaa tgaagccaag ggcgtgaata atattgatga ggagagcaag 22800 tagaaatgaa aagagaaaac cagtattagg agatctataa agaaacaata gtcagaaaaa 22860 gagtggttaa agaggcagaa gaatcagaag agaaccagat tcatgaaata aaagagagga 22920 atcagtttca ggaaggaaag tgtggtcagt agtgtcaaat accasagact gaatagycta 22980 aggatttaaa agaggtcaac agcwccacca attaagcggt ttttctgagc ttaagatatt 23040 tttctttatt tttctatgtt aaaaatattt tataaagaaa aagaaaaaaa gagatcacat 23100 ttgccccccc aacccgtccc tggcctctct ctctctaagg aagtcgttaa tgaatttggg 23160 cagaatagat tcagagctgt gagagtgaaa atttgactgc agtgggtgag gtgttaataa 23220 aagggagggg gtagagatga aatctttcca gcagtttggc tcttaagggg atagcttgaa 23280 ggaaagaggg ttggaagggt ggtttttaaa aagaattatc actcactaat caactagaaa 23340 tccagtggaa tatgcagtac ttgtctgtaa tccagcaggc taactttttt tttttttttt 23400 ttttgagacg gagtctcgct ctgtcaccca ggctggagtg cagtggcatg atcttggctt 23460 accgcaacct ctgcctccca gttcaagcaa ttctcttgcc tcagcctccc aagtagctgg 23520 aactacaggc atgcaccacc atgcatggct aaattttttt tgaattttta gtagagatga 23580 gtttcaccat gttggtcagg ctggtctcaa attcctgacc tcaagtgatc acccgcctca 23640 acctcccaaa gtgctgggat tacaggccag agccacagtg cccagccaag gctaacctct 23700 tgatcccaat gacaaacaga acaaacatct tactcaagtc cagaagcaat aataattttg 23760 aatcttgctt gcatgtcaac aggagccaca ttaatacaga agaggatcac attggtccaa 23820 ttaaattgaa ttgattgaga gcctctgcaa tacacggtct actgcacaaa taatgatggt 23880 tcctggtaca tttttatttg accattgatt gctcgatttg tttctgtgtc taactgtgta 23940 ttggaattaa gctgactcaa tttgaactgc aggtcctttt atccctctta ttttttattt 24000 atttatttat tttattttaa ggcatttctc actctccaga aaatctctaa gatttcagct 24060 attaggcatt tgtctttttt tttttttttt tttttttttt tgagactgag tcttgctggg 24120 ttgcccaggc tggaatgcag tgtcacaatc tcggctcatc acaacctcca cctccctgat 24180 tcaagtgatt ctcctgcctc agactcccga gtagctgtga ctataggtgc aaaccaccna 24240 cgcccagcta atttttgtat ttttagtaga gacaggtttt cactatgttg gccaagctgg 24300 tctcaaactc ctaacctcaa gtgatctgcc cgccttggcc tcccaaagtg ctgggattat 24360 aggcatgagc caccgtgccc caccaggtct tcttaatcat aacaaattca tcttaaaaca 24420 tcattttaaa atatattatt ttttattctt cattgacttc ctgcattact ctattttttc 24480 agagtttcta gcaaccagta tcattggtat tttaaacaat gtgtacatgt acatttatgc 24540 agatgagtta acatatatca aagcaacctc caaacaatgc catttaggta atctccaatt 24600 taaaagcctc aatagaatga taagattgag cttttctgta gttccatgac ctccagcaga 24660 gtctgcaagg ccacagctgc ctgaaggttg attctgtaat tagaagatgc cagggtcatc 24720 tcagaataga acctcaagcc acccaggcta catttacaga atcagcctct ccagaaaaac 24780 agcaacaaag gagggccttc ctatgtattt ggaaggagtc acctagagga gggacttggg 24840 gttttggtgt tggtgtgggg ggcagggatg ggatggggag ggggaagctt attgaaatat 24900 actaaaagac aaaccaacct aagggctgga gggaagaaaa ttcacacttg taagcttctt 24960 ttttaagggg catctcttag gctctagctt ttgagattca gtatatatat atttttgagt 25020 cttgctctgc tggagtgcag tggtgtgatc tgggctcact gcaagttctg cctctcaggt 25080 tcacaccatt cttctgcctc agcctcccga gcagctggga ctacaggcgc ctgccaccat 25140 gcccagctaa ttttttgtat ttttagtaga gacggggttt caccctgtta gccaggatgg 25200 tctcgatctc ctgatctcat gatccgcccg cctcggcctc ccaaaatgct gggattacag 25260 gcgtgagcca ccgcgcccgg cccagtattt ttgttttatg aagatattac atttgtaaag 25320 tatgagcttg gtgtcagcaa acttatatcc ctgtgtacaa actggccgaa tcacttagcc 25380 actttgggcc aaatcactta gctcttctaa cagtaagaaa tcaacaagaa aaataaacat 25440 ttcaaacatt acaatgtgtt catgtattca ctgtggggga tgaccagatt ctcgaaacca 25500 caggttgttc ttagtgaaac aagtttgggt tggggccata gacttgtgta tttagaatca 25560 atggctgttc tctctctggg actttgattt ttttcttggg ctcatcccct ttttgtagta 25620 tcttatcttt gtcttatttg tataggactt aactgttccc attcccttat tagagcaatc 25680 taagtgatta ccttcatacc ttttggaatt atatgcttca aaattccaaa aagaatgatt 25740 ttgggctggg cacagtggct cacacctata attccagcac tttgagaggc tgaggtggat 25800 cgcctgaggt caggagtttg asrccawcct ggccaacgat agtgaaaccc cgtctctact 25860 aaaaaatata aaaaattagc cgggcatggt ggcaggtgcc tgtaatctca gctactcggg 25920 aggtggaggt tgcaatgagc ccagatcgca ccattgcact ccagcctggg caacaaaagg 25980 tgaaactcca tctcaattaa aaaaaaaata atgattttgg tggtcgactt caaataggta 26040 ggagaagaag gagagaggag atggagggtc asggagatct aattactctc taaaatcatg 26100 ctaggaaaga taacaccttt taataacact ctctgctttt ataacatcat tctgccaagg 26160 agctcaaagg tttcaacama gttcactttc agaaaacccc tttgaggaag acagaatata 26220 catcttctct ccmttttaaa gatgaagaaa caggccgggc acaatggcta atgcctgtaa 26280 tcccagcact ttkggaggct gaggccasar gatcgcttga gctccaragt ttgagaccag 26340 cctggataac atggcaaaac cctgtctcta caaaaaaaat acgaaaatta gatgggtgtg 26400 gtggcatgca cctgtggtcc cagctacttg ggaggctaag gtgggaggat cgcttgagcc 26460 cagggagtca agtctacact gagccatgat tggatcactg cactccagcc tgggtagaca 26520 gagcaagacc ctgtctcaaa caaaatgaat gaaagagaaa gaaagaaaga gtgagaggag 26580 aggagatgag gggaggggag ggtagcaggg agggggggag gaaggaagga aggaaggaag 26640 gaaaaaaaga tgaaaaaaga aatacgcaac atgaaacaga ggcagaaaga ctttacgtaa 26700 attgctcatc atgtggttgt caagtttgac cccaaaaccc aatttattga ccaaggttat 26760 tctttgactg aggcaagggg gtccgctctc ctgggccttg ggctttagaa agctcatctc 26820 tggcctttct gagatccatc cctttctttt tatttttctt gacacggagt cttgctctgt 26880 cactcaggct ggagtgcagt ggcatgatct cgactcactg taacctctgc ctcccgggtt 26940 caagcgattc tcctgcctca gcctcctgag ataacaggcg ctcgccacca catctggcta 27000 atttttgtat ttttagtaaa gactgggttt catcatgttg gccaggttgg tttcgaactc 27060 ctgacctgag gtgagctgcc caccttggcc tcccaaagtg ctgggattac aggcatgagc 27120 cactgcgccc agctcagatc catccctttc taagggcaaa cagtccatgg tgcaaagggg 27180 ccatgccacc cagagttatg agtacctggg actccagaat tccttgcctg gtggcctcca 27240 catgcacttc cagggcctgc ttgggcctct tctatgggtc tgtcctgagt gttgatagaa 27300 ccactgatgt gagtacctgg gcttgagccg tggcctggag atcctgttga ctgtagcatg 27360 gagggggctt gtgcagctga atgtctgyat gcaggtggtg ggagttctgg aatatgatgg 27420 agctggaggt gggaagagaa gtaggcttgg ggcagctctc tcatgccacc tcattctggc 27480 caaaactcag gtcaaactgt gaagagtcta aatgtgaatc tgcccttcaa ggtggctaca 27540 aaggtatctt tgtcaaggta ggagaccttg tggcctccac gtgcacttcc agggcctgct 27600 tgggcctctt ctacgggtct gtcctgagtc ttctatgnaa tctgtccttc agggcagatt 27660 catatttaga ctcttcacag tttgacctga gttttggcca gaataaggtg acatttagtt 27720 tgttggcttg atggatgact taaatattta gacatatggt gtgtaggcct gcattcctac 27780 tcttgccttt ttttttgccc ctccagtgtt ttgggtagtt ttgctcccct acagccaaag 27840 gcaaacagak aagttggagg tctggagtgg ctacataatt ttacacgact gcaattctct 27900 ggctgcactt cacaaatgta tacaaactaa atacaagtcc tgtgttttta tcacagggag 27960 gctgatcaat ataatgaaat taaaaggggg ctggtccata ttgttctgtg tttttgtttg 28020 tttgtttctt ttnnntnnnt nnntgttttt gtggcctcct tcctctcaat ttatgaagag 28080 aagcagtaag atgttcctct cgggtcctct gagggacctg gggagctcag gctgggaatc 28140 tccaaggcag taggtcgcct atcaaaaatc aaagtccagg tttgtggggg gaaaacaaaa 28200 gcagcccatt acccagagga ctgtccgcct tcccctcacc ccagcctagg cctttgaaag 28260 gaaacaaaag acaagacaaa atgattggcg tcctgaggga gattcagcct agagctctct 28320 ctcccccaat ccctccctcc ggctgaggaa actaacaaag gaaaaaaaaa ttgcggaaag 28380 caggatttag aggaagcaaa ttccactggt gcccttggct gccgggaacg tggactagag 28440 agtctgcggc gcagccccga gcccagcgct tcccgcgcgt cttaggccgg cgggcccggg 28500 cgggggaagg ggacgcagac cgcggaccct aagacacctg ctgtaccctc cannncnncc 28560 ccaccccacc cacctccccc caactcccta gatgtgtcgt gggcggctga acgtcgcccg 28620 tttaaggggc gggccccggc tccacgtgct ttctgctgag tgactgaact acataaacag 28680 aggccgggaa cggggcgggg aggagggaga gcacaggctt tgaccgatag taacctctgc 28740 gctcggtgca gccgaatcta taaaaggaac tagtcccggc aaaaaccccg taattgcgag 28800 cgagagtgag tggggccggg acccgcagag ccgagccgac ccttctctcc cgggctgcgg 28860 cagggcaggg cggggagctc cgcgcaccaa cagagccggt tctcagggcg ctttgctcct 28920 tgttttttcc ccggttctgt tttctcccct tctccggaag gcttgtcaag gggtaggaga 28980 aagagacgca macacaaaag tggaaaacag gtaagaggct ctccagtgac ttacttgggc 29040 gttattgttt tgtttcgagg ccaaggaggc ttcgggaagt gctcggtttc ggggactttg 29100 atccggagcc ccacatcccc accacttgca actcagatgg gaccggaggc ggtgttaaat 29160 ggggagacga tgtcctagta cgagctctgg tgaccccagg actctgcgct gctgcgcttg 29220 ngggcttgcc cgacggtgga gaccggggag catctctggg cgtggagacc cgggcgcagt 29280 accccgggct cagaggggtc gggggttccc ggngcgtgct gagggcgctg ctgccgggtg 29340 gggagagctg caggtccggc accgagncgc tgctttgttc ggagggccct gagctggcnt 29400 agnaaaccct tctggttgca ggtcggccag tacctacgga gacaaatgcc agcactgagt 29460 cttcactcgg ttcttaagaa gctggtctgt tctgacctgg gaattggcta tatgctcccc 29520 gggactggag cggcacagtc ccggactgtg aatccgggaa ctcgagttgg aggtgtccca 29580 aacggtccgt ggtgctattg ctcactagag gccttgggtc tttgntttga cctgaggggt 29640 agggaggtcc tgcctacagt ctccgtgcgc tcagctgagc tggtgtccct ggcgcagagc 29700 gcggacgagt tttgtttcct tttctttttc ttttttttct ttcctttaag tctcggtctg 29760 tcgcccaggc tggagtgcaa tggaacgnat ctccgctcac tgcaacctcc gcctcccggg 29820 ttcaagcgat tctcctgcct cagcctcctg agtagctggg attacaggcg cgtcaccaca 29880 tccagctaat ttttgtattt ttagtagaga cggggtttca acttgttggc caggctggtc 29940 tcgaaccctc gacctcaggt gatccaccgn gcctcggcct ccccaagtgc tgggattata 30000 ggcgtgagcc accgcgcgcg gccgagtttt gtttctttta aaaacaagac ttaggagagc 30060 ctgcggagac ccggaggtgg ggtgcccaat cctccctctc ccacgttccc tgcagcccca 30120 tcttccagac cgttgctgct ggtctctcgg ggcagcttct gcctgggcgc agatggggaa 30180 gctgggccga ggtggtgccg tggaatgacc gggagtaacc ccggcgggcg gcgcagaact 30240 cggagctccg ccgcggggct gggctgggct ctgccgtgag ggtgggggtg ctgggcgcgc 30300 gggctgcggt ggnccccgga gactggcccg cagncgcctc ctggccggag gacctaggaa 30360 tcggccggct ctactaggtg tctttgctcg cggttccgac tgtgaatccg gtgaagaccg 30420 gtggttgcag acggggagga actatgaggt tgaggcgaaa gcccgttttg tttttttttt 30480 tttgtttttt tggttttttt ttttgttagc gtgtttgcca actcccaggc cattggtaaa 30540 gcaggaaggt tcttggggcg gcggacggtg ccagggttat gtgtaggtgc ctctttaggt 30600 atatctttta tcaaaaagaa gcaaagaaat aagattaaaa ataaacaaag aaaaaagttg 30660 tctggcactg gcagtaattg gcctgccttt gcagcactga taccattagc ttttaaaatc 30720 cgacttttca ttgacacttc aagaagagaa tgggtagtat atacacattc atctcatagt 30780 ggacaaattt catatttaaa aaaaccttct gggtactgaa atcagcaagt cacttgccct 30840 ccatggccga atccctgctt cccacgaaga gaacctcaca aaaatttccc ccaagttaaa 30900 gagtggaatt ttcttgattt ttttnttctt ttttttttaa cggccgtagt ttagaanccc 30960 agacttaaat tatgatcttc ttttcaaaca aaacttaaag tccttaagtt ttcatctccc 31020 cttttatttc aacctattct tctcatacct accacaaaaa taatggaggc tttctgttga 31080 gaaactttcc gtttctgttg agagtatcat tctcttgaga aactttctcc taaatcagag 31140 aaagtatgga agcatggaaa gtattcctga gtagaacctc tacagatatt acaatatttt 31200 tcaaatacaa agtttccatt gtcagcctgt ttcccaagtg cttccacaaa ccattaaata 31260 attccacaaa ccattaaaat aattaatgct agggaatttt aggaaaacat tggtttacaa 31320 tcagaaggac cggggaagtg ggtcttcagc cttcacgatg actacaagcc atttaaggga 31380 ctagaattgc tactgttgtc agagcaattt aggagtctgt atttgagcac ccgcatagtg 31440 ttccagaatg acatatctga ctgtaacctg gacacgtgtg atatgttgtc tcccctgcag 31500 atgagcattt gaaatctcaa ccctcgtatt tctacgagtg caggcctata atggaccctg 31560 ggcacatttt tttttttttt gagatgcagt ctcgctctgt ttcccaggct ggagtgcagt 31620 ggcacgatat ggctcactgc aacctccacc tcctgggttc aagagattct cctgcctcag 31680 cctcctgagt agctgggact ataggcgcac gccaccatgc ctggctagtt tttgcatttt 31740 cagtagagac agagtttcac catgttggcc aggatggtct cgatttcctg acctcttgat 31800 ccacccgcct cggcctccca aagtgctggg attacaggcg tgaggcaccg cgcccgaccc 31860 ctggacacat tttgacttag aacatatttt cggtttgtgt gagacagtgc attagtgcag 31920 gattggaaaa gagtgatcag gaattgattg ttttcaagga ttggttcctt ctgctcaagg 31980 aagtcccatt gtaaacataa aaaaatgaat gaaactgaag aagttcagtg acttagcttt 32040 ttattatctt ctgtagtact tacctttttg gagaggagtt ggttgggata tttttccatt 32100 taaatttttt ttttaaaggg atcttctctc ccgtaagccg ggatacttaa gctatatatg 32160 tagtggctac aaattaaggt cttcactgtt ttcattttta gctgctagaa taagtgaaca 32220 ttaccttaga tagactcttc taattatgaa gatatctaga tgtctagaaa atatcaaaat 32280 gcatgtggtt tttgcatttc taaaatactt ttaaaaccaa atactttttc tttttttttt 32340 ttttctgaga tggagtcttg ctcttttgcc taggctggag cgcagtagaa tgatcttggc 32400 tcactgcaac ttccgcctct caggttcagg tgattctcct gcctcaacct cctgagtagc 32460 tgggattaca ggtgcgtgcc accacacccg gctagttttt gtgtttttag tagggacagg 32520 gtttcaccat gttggccagg ctggtctcaa gctcctgacc tcaagtgatc tgccagcctc 32580 agcctcccaa agtgctggga ttacaggcat gagccaccac acctggcctc aaatacattt 32640 ttttaagtat ccagatatta aataaataat accattatag tagttgttat ggtcatttac 32700 tctagcatca aatgttaaaa gatcattctg aacacttgtt ttgtttatgc tgagagaagg 32760 cctactccaa aaaatgcaac catcttcgta tctgcatgtg gatacaacca tgaatggcca 32820 aagttattgc agtgttgaat agacacttat atagcactgt gtggcaagta ctgtttgaaa 32880 tgtttttcac gtgtttattc attgtattta ttttgagata gggtcttgct ctgttgcaca 32940 gggtggagtg cagctgcaca gacaaggctc actgcaacct cagcctcctg cgctcacgtg 33000 atcctcatac atcagcctct taaggagctg gggccacagg cacgcaccac tgctcctggc 33060 taaattttta caattttttg taaagacaag gtctcactat cttgctcaga ctggtcttga 33120 actcctgggc tcaagtgatc ctcccacatc agcctcccaa agtgctggga ttacaggcat 33180 aagccactgt gcaggtcatc aaatgctaat tgaattttca caacaaaacc atttattgtc 33240 cctagtttac aagattaagt aaatgagaag ctaatttttc tctggctata taccttgcaa 33300 gaggcagagc taagacttga acccagccag agttctttaa ctccagcact aacatttcag 33360 ctgctgcaac cagggagctt ttcaaggatg atcaccacat tctctacatt catctgctat 33420 aatccttatc agaatctaca gcctgtatca tattttcctt gttgctgtga gtgggtcagc 33480 caaattctct ttaacttgaa accttggttg cgtagggatt gcaacatcct ggaaagaata 33540 gaataaaatt tactcaactc aattttttac ttggttcata atgaaaacta tactattgct 33600 tcagtcagat gtttgcgaat agctgtgtga tctcaaaatg ttttcctatg tgatctatag 33660 naaaatggaa tgatagagta ttaggctgta agggcctaag anaacaaagg aaaaagagaa 33720 gtgaactgtt agtttagttg taaaacctaa ctttggtgaa ttgtaaaaat ttgttataat 33780 acaatatgat tcttgcttgt cctgtccttg atgaagttgt ggaccttttg aaataagcta 33840 tttctctgtt actgctgtta ctgtttagaa atcaaattta gttttttctt aagatatacg 33900 tatttttgga agataaacac agtttcaaag tctgccttgt tggctgggtg cactggctca 33960 ttgttgtatt cccagcactt tgggaggcca aagcaggagg atcacttgag gntcaggnag 34020 ttncaagacn cagncctggc aaatatggtg aaaccccgtc tctactaaca atacaaaaat 34080 tagctgggcg tggtggtggg tgcttgtaat cccagctact gggattggga ggctgaagta 34140 gaagaattgc ttgaacctgg gaggcggagg ttgcactgag tcgagatcgt gccactttac 34200 tccaacctgg gcgacagagt gagactccgt cttgaaaaaa aatgtctgcc ttgtaaaagt 34260 gaaataggat gagaaagtgc ctttcttatt aattggtgta atgaattaga aataaactct 34320 ttgaagacac ctcttggtaa aaatagttac atttactgtt gatttatggt atgttggata 34380 tgtttttaag ttttccgtgt aataactcag ttcattctca tgagtgaaat aggtgctttt 34440 attgtcttta tagatgggaa actgaggtat aggcaggcta ggtacattat tatggagttc 34500 gtaagtagtg gagctgaagt cgcatcccag acagtttggc ttccgtgagt ttaccaatct 34560 catggtaaag actttgtcag actatcaaag ttttgacaaa tgaaatatta gcaaaaggcc 34620 aaaagggatt ctctattttc atttgagtat cttcacctga aaatagttgc ctgaataagt 34680 agcctgcata gaaaggtaca ttttagaaat acttgaggcc agagaatgaa aagcttacat 34740 aaaattgatt tccggtgggg ccttcagtta ctctccattc tacgaagacc acaaatagca 34800 ttcaggcaaa gagcatttat tccaacaatg gaggagcact ggatttggtt cctaaaacaa 34860 aataaagttt gaaatcctgt ctttcccatg ttgaaaacaa agttggtaca aaacccttta 34920 gcttttgcaa acctccttta agacccgatt taaatgcytc cctcctcatg aagctcttct 34980 ggatccactc yttcccatca ctaagttgaa agtaagatcc ccttctcttt acttccatta 35040 gacttggatt acagcactct ttgtatcatg tatttaattc tgttttttaa ttacagttaa 35100 catttatttg tcttcctctt gagtgtatgc ttctctagag gaaggtcttt gattcattct 35160 cccctggcct taattcatcc cacttaatat ggaaaaaatt taataaatgc tgacttgaat 35220 aagtccaaca aggagaatgg gaagctcatg tttgcttcct gtcttctaaa agactactta 35280 agataacagg gtaatcacag aaaagcatta gaaatagagt tatatgagaa acaactgtag 35340 ttaaggctag gtttatgtta gactgagaaa ttttagtgca tacttaagtt atttaggcca 35400 ggttactttt tgtagaacaa acatttcagt ttcgctcagt ttcatttccg tttctgnagg 35460 cagctgtgat ttaagaaaat gctctagtct gtggcattcc atattcaagt actttgagtt 35520 gtatattaat ttatttttgt taataagagt gacatgactc actaagtaat ttagagattt 35580 aaacactttt ttaaaaaaca gtaacttcat atgcattgga tctattcttc tataaagtct 35640 tttcttgggg ggtgtttgtt taaaattccc cggtgttttc tctgccaaat ccaacttcca 35700 agaagcattt ggaagtcaaa acattttatc tggttagtct taaagtccag atattttgtg 35760 atagctggta tttagtttat gatatttccc aggaagaact ttttagtagt tgaaccattt 35820 atgaaagact tccttgaagc taccttagag agttgattta gttcttccta aataagtaaa 35880 tagaatatta gtattaggac atcttggagt atagatgcaa atattggtga aaaagaacat 35940 ggatatcaga gtcaaattaa tgtagattgg aattctggtt attactaatg gatatctgac 36000 attaggcaag ttgctgatca ctctttgcct cagtttcatc atctgtaaaa taggtatttg 36060 tgtttgtgta taatgtgaac cgtataatat aatgcttggc ctatagtgaa atttattcat 36120 acgagtgttt tcagtgattt taaaagcttg ctttaggccg ggcgcgatgg cttctgccta 36180 taattccagc actttgggag gccaaggtgg gcggatcatg aggtcaggag ttcgagacca 36240 gcctgaccag catggtgaaa ccccgtctct actaaaaata caaaaattag ctgggcgtgg 36300 tggtgcaagc ctgtaatccc agctactcag gaggcttagg cagaagaatc gcttgaaccc 36360 aggaggcaga gattgcagtg agccgagatg gtgccactgc acagagcgag actccatctc 36420 aaaaaaacaa aacaaaacaa aacaaaaatc ttgctttata gtttacttcc acatcaaatt 36480 gtctttatcc catgttactt gcattgatat cccagacatg aaaagaaaaa aagatgataa 36540 caatgacagt tattaaatta ggttccactc ttattctaga tcaccaattc atattactat 36600 tcagacttgg aacattaaat tttagttaaa ctttttttca aatatgcata taattgtcag 36660 tggttactat attttgggga agagattgtt gacttctttg aagaaagata cggattttct 36720 cttcagaara aatacacatg ggctcatata atccaaattt tatgtgtaat tacagggtgt 36780 tcatgaatgc ccacaaatcc attaagccat gtgaccttgg acaagtcatt ttacttttct 36840 gttttttagg ttgttggtct gtaaaatgat actacttgac ttttaaagag cccttcaagc 36900 tcttatgtcc tctaacccca ggtctgtatt cagaagaagg ggtggtcctt taattagagc 36960 catctagaga tctgaggaac atgctgggca ttagtgtaac ataccatgtg gattttgaga 37020 ggtaaagaaa aaataaccag ggaatgcctc agagcattcc tgatcagatc gatgacagaa 37080 gaaaggaatg agagggagga gaggaagctg ttgaaatttc ctatttacct gctttgagtg 37140 aatgaagatt tgaatcatag aaccagaagg ggttctcatc tgaaatgcaa aggaaggagg 37200 agttggttta attcaataag tttcagttga gtaaacatga tttagtgaga tactgttctt 37260 gcttctgact caccatttgg aaaatctctc taaaataaaa ttggactctc catctcggac 37320 atcattttgg gtgtaggttt tgcttttttt tttgagatgg agtctcgtta tgttgcccag 37380 gctggagtgc agtggcgcaa tctcggctca ctgcaacttc cgcctcccag attcaagcag 37440 ttctcctgcc tcaacctcct gagtagctgg gactacaggc gtatgccacn catgtccggc 37500 taatttttgt atttttttaa tagagacggg gttttactat ggtggctagg ctggtcttta 37560 actcctgacc ttgtgatctg cccaccttgg cttcccagag tgctgggatt acagatgtga 37620 gccacagtgc ccggcctaag ttttacttct tataatggac tcctgttaag ccaataggtg 37680 atgaaaggaa accataacca actcttcagg ctcattcatc cttcaagaat agcatgctag 37740 tacccatcct aggaggagaa ttggactata cctcatgagg atagtttgaa gtatctcaga 37800 agaccctcac tgggggtagg tcggtaagac acaaagcttt ctaaagcact gtaccaaatt 37860 tgttgtttga gagatcataa caaattagaa gtggaaagaa gaaggagtaa aaggaagaag 37920 aggtttctgg ccaggagcag ggagggggaa ggagctgcta ggaagatgtt tggttgtcat 37980 atccctgttc acccttgctt tgcaaaattc ttgtaggatg ccaggttggg agtaattgtt 38040 tttcacaaga gtcaaaccac gcttgttttc ttgaagaagc aagtctttgg agggtggtgg 38100 cttgaaatct gttggatctg gtatttaggt gatacacttt gacataaagg ncaacactga 38160 tgcaagcagc agctttcctt ggaaaggcag ggagaaagtg aaggcccaga ctgatgagct 38220 tacactgacc tgcagaccct tctccattcc caggcatgtt tggtggcaga gtttacctta 38280 gttggggtta ggctgttgtc tggtactgtg agagagaagg aagaagaaga tatgatatta 38340 acaacaacaa tacatattta tatttgaaaa ttaaatgcac taatacacct atagtgctat 38400 taaacaaatt tatttgttca gcaaatgttt gttaaacacc aatgtactgt ggaaagtact 38460 aggtgctgga cagaggtcaa aagacttttt aagaatctgc caccattaat gatctctttc 38520 tgcttggcat tcaaggctct ttgaaataag actgtgaccc actttgatag ttttgtcctg 38580 gattataaga cacatgctcg aaggaactat agctggtttt ctcaccagac tgattaacat 38640 atagtatggt ttggtacctg ttaaatgagt ctctctctac aggttttcat cttctacttt 38700 aagaacccct tcctggcatt gtttggctcc tcattgttct ggaatctcat gtccatctca 38760 tgtatttgcc ttggttcaca cttgatcttc tgctcatatt cctcacctac tgaaatttta 38820 cccatcaacc agaccgtggg tgatggaaca cagcatgggc tagttcttct tatatatgat 38880 ctcatttaat tttcactgga actctgagat aggtagcatt tcagcccact caagttgact 38940 caaatttttg taatcatcaa tatattttaa aataacttta tatttgacct gtaattggaa 39000 aaccaatatg agttatcata aatgaaaggt aattttaaaa ataattagga tgaagacaaa 39060 ttatttttct cacagtctat gtataagata aactattggt tcccaaaggc cccagctgac 39120 aatgagactt ctcttacttt gttgaaaagg gaattagcaa gcattaaaga ggtgtcaaaa 39180 agaagactaa acaaaagctt actccttttt ttttttgaga cgagtcttgc tctgttgccc 39240 aggctggagt gcagtggcac cacctcggct cactgcaacc tcttcctcct gggttcaagc 39300 gattctcctg cctcagcctc cctagtagct gggattacag gtgcatgcca ccacacccgg 39360 ctaatttttg tatttttagc agagatgggg ttcgtcatgt tggccaggct ggtctccaac 39420 tcctgacctc aggtgatcca cccacctcgg cctcccaaag tgctagaatc acaggtgtga 39480 gccgccgcac ccggccgctt actccttaat gtactaagaa tgttatatat aggctgaaga 39540 agtgctgaaa agaaccatat tttctcatga tgtggttcaa tgtttaatac tgtgcttgtt 39600 catctcctaa aatcctctga atatcactta aattcatcct gtgtaactct cccacatttg 39660 gggaaatact gagcttgcct attattatat tagccccata tttcagatga tgcacctgag 39720 ccgaggagaa gttaaataac ttgttcatgg ttccatattt ggctaatggc agagccaggg 39780 attcaaactc ttgtctctct gactcccagg tttgtgcttt tcccacttgg ctgaatttct 39840 catgctacct cctccataac acctttccta gaacttttaa ggaatgcttc ccctgttctc 39900 tcatagcatt ttaatgtaag ttgccaaagt gtcccagttt gcacccctac cagcaatgtg 39960 ggagaaaata gcaacatatt tttgatgttg gggtctagtg ttacggtttc ttctgctctt 40020 tgggatgtat attccatggc tactgaatac ccaagtccca aacagttttc ttagactcag 40080 aaggtcatcc acttcagctt cttcatcaaa aagacatatt ttctggctgg gcacggtggc 40140 tcacacctgt aatcacagca ctttgggagg tggaggcagg agaattgctt gaacccagga 40200 ggctgaagtt gcagtgagcc cagatcgtgc cactgccctc cagcctgggt gacagagcga 40260 aactctgtgt caagaaacaa aaaaaaggac ttgttttctg ttccattacc cacagtggta 40320 gaatggcgtg ctaaatttat tctccagctg ccattaactg caaattaaaa tcttagtctc 40380 ttgcctcttt aatccaggct tcttcatact ataccagaat ttaggataac tattacagtg 40440 ccctttatag gagagaaaga agaaattgtg tctgtagatg tctgttcctt tcagcttaaa 40500 atggacactg aaatgttaaa tattggactg gcctcattta tttctcctgt ctgttggtcc 40560 aatttgaatc ttaaggcgtc tttcaactgg aattttttgt ttctctcaac taaaaattgt 40620 tctttgtaag tttgaatcag aacaaaatcc tgaatgttga gggtttccta aaggctgttt 40680 ctttatgcaa aagcctgaaa cccgatgttg atgttggctg cttaaaatta actgtgaatc 40740 aaggcagggt ttttattttt attttttttt tactttaatg attgtgttaa ttatagtgaa 40800 aaccttgagt tcacgagaaa gaaagccttt ggtcaagtat tgtttattaa gttgtcagtc 40860 ttgttgcagg atttgcaaat ttagtggaat tagtgccatt tttcagttta caattccagt 40920 cacatttcac atgatcagag catggctttc ttctctgtgg agcaaataga gggctgtctg 40980 acacttggtt ccagtggctt ccattaagca gagtggatat gtccctggag tctgcagaga 41040 agggcatggc actctgaccc cagatggcac tccgttttgg gacattgtcc aattctagtt 41100 catagcatat gtgaccaaca ccagctctca cctgatgtaa acacttagcg cgttgttgct 41160 tgggggattg gattgtgtga atttttcaaa actacagttg acagaaggag gctaccaaaa 41220 atgaaaccca ataattccat ttttgggaat tattcccact tttgttccat ttttcccact 41280 ttgttctttg gcacacagaa tgtttgattt gtgaaaatct taataacagt agttttttct 41340 ataaggaaca ctcagaatct tgataatatt ggaataatac agatcctttt gtaggatcct 41400 ctcagacctc atataataga gttcatgtag tcaatattta aagaaaaaca cccttaagtt 41460 tttgtttttc agaatcacaa gtaagtggat ttaaacttgt gatcttattc ccctttcttc 41520 tcttaattta gtgaggcagc cagcgagagg gtttgttttg gttattctaa agaaggagtt 41580 tgcttgtaag ttttggaggg caagacttag actctgtgtc tctgtgcttg ccctggaact 41640 ttgattaaat tgtcactaac cgagttagct ggccctcgcc gggctgcaga aatagaagtg 41700 tcttgcacac atgacatatg actgtctcaa gagctggctg gtgaaaggac gttctggaga 41760 aggctgccga tactgtatga actagaanct ggacaagagc ctggagattg gataactcag 41820 tttggcgcaa gtaaagggaa taaaagtgtt aaggtggcaa aattgtatcc aggtgtttat 41880 aggctccctg agttcctgac ttgagcctat ctatgggttt agagttcaag gctctttacc 41940 agtgctgaca atcttatact ctaggttgaa cctccgggga aggtgccctt gcttgatggc 42000 atgtttacca ggggttctag agcctcaatc acagattctc tctagctcac atgaagttaa 42060 tgaaaatgaa tgtgcttccc tacaaattag agaggctttg aggaaaaatc agattaaatg 42120 cactcctgct tgaacttatg tttcttagaa cacagctgga aattttgtca cacaaacctt 42180 tactttcagt gacatttctt gactggtttg ttactgtagt gaatctgctt taactatctt 42240 ttcttatcgc tgaggtttta cttccattct acatgtgatt gtggagcgct gcgtcattgt 42300 gggttcagtg tagtggagag taggaagatg gtgagacaca gtagcttgtt gcacattgct 42360 taatttatca gggatcactg atgagttagt acactagaga agattgtagg tagagctgaa 42420 aagatggagg aattataagg ctcagatttc tctctttttt ttttttttta agatggagtt 42480 tcactcttgt tgtccaggct ggaattcaat ggcatgatct cggctcactg caacctctgc 42540 ctcccgggtt caagtgagac ttgatggtct cacttgatgg tttcctggct cagcctcctg 42600 agtagctgag attacaggca cccactacca tgcccagcta cttttttgta tttttagtgg 42660 agatggggtt ttatcatgtt gaccaggctg gtctcgaact tctgacctca ggtgatcacc 42720 tgcctcagcc tcccaaagtg cagggattac aggcatgagc cactgcgtct nggccaaggc 42780 tcagatttct aatagagatn ttctaatgga catagaggct ggaggaaatg ggatnggaca 42840 ngganaaact gagtncnagg tgccaaannn aacttgtagg gggccgggtg cggtggctca 42900 nncgnnccnt gtaatcccag cactttggga ggctgaggcg ggtggatcac gaggtcagga 42960 gatcgagacc atcctggcta acacggcgaa accccgtctc tactaaaaat acaaaaaatt 43020 agccgggcgt gttggcaggc ggctgtcggt agtcccagct actcgggagg ctgaggcagg 43080 agaatggcgt gaagccggga ggcggagctt gcagtgagcc gagatcgcgc cactgcactc 43140 cagcctgggc gacagagcca gactccatct aaataaataa atagataaat aaataaaata 43200 aaataaaaac ttgtagggag gtggcagtgt gctatggagg ataggtgcaa cctctgtgag 43260 aatgtagaga aaatagtata agtgagtggt gaggaccccc aagagggggt tttatagtaa 43320 aacaatggtc agaagtggca acaggatacc gtataatgct ttcacctcta ccaatgcact 43380 gggtactgga gagcgctcca gtttgctctg gaaaggccct ttctgtggac aaagaataca 43440 gaaaagagat tcctttaata aaccacccac tctgtgtccc acccttgatg aatacttcac 43500 tgtgaaattg ccagaattaa tcatggtaat agctactgta cacttacttt gttccaggaa 43560 ctggataaat gttttacata cattatcagt tctatttttt ggagaagata caggggctca 43620 gagtccctag ggttccagag ctggtgagta gcagagtcag gattcaaacc cagctttctc 43680 tgactctaaa acctcctttc ttcctgctga aacaattaac tcaagacaac aaaggagtta 43740 aggatttggg gagttttctg catggtagaa tagacccaaa ggaaaagaaa gaaagacagt 43800 gactaagatt tgggtttgtc tgccccacca aatgctttga gcactttcat aaatataaat 43860 ccttcaggtt gggagaaggt tgaacatctg aagacactga ttcttcagag atgtaatcca 43920 aacaaagtga tctttggtga tatggtcact aaaccattta tcccaaaatt ctcttggaaa 43980 accgtcctat aacagcaggg aacattatcc agccaagttt ttctgcaaat aaagggttgc 44040 tgatagaggc ttgcctgctt gtgtttctng tnagcntnca gggtgtttat gaattcacta 44100 atcccttccc ttcagatccc ttttattctg gtgttatgat tgtgactgna aaaaaattga 44160 ttttttttct atgacataga atgttgaaag gttgatttct tttctagagg aaagattctt 44220 tttttctatg tgctacatac cccccgacca gggaaaaggc aaatagtggt attgtttgct 44280 gaagtcttcc tttgaaggtt gcttggtgtt tgcttagtgg aaatcagcag gggaagagag 44340 gctatctctn aacattttgt tagangtttc ttcgtnagtt ctatagtgat gaaacaagga 44400 cttggggtac aggacagatc tgctttcaga aatcctggct cttgtgaggt ttagaagccc 44460 tgagaccatt tagctggtgg caacgggaat gttgagggtg ataaatagga tctttggttg 44520 tccaagtatc agtgacatga tatagatgga gttaaacctt taggatctcc ttatttattt 44580 gtttgtttat ttttgagaca gggtcttgca ctgtagccca ggttggagta tggtggcatg 44640 atcataactc actgcagcct caaactcctg ggctcaagcg atcctgctgc ctcagcctcc 44700 caaggtgtta ggattacagg catgagccgc cacacccggt caggatctcc tgtaaaatta 44760 tattgttgac aacatgaaga attatgcttc tcaaaagcta gttatagatt tgtacaatat 44820 tcatagattt cttgtttcag tttttacaaa ttcatagccc ttattttgaa aattagctat 44880 tagcaataat tttgtctagg aaattggatg tgtattcaag tgaaagaagg aagtacagtt 44940 acctattatc ttattgtaac taacaatcaa gtaagtgtga tgcatttggt actttaaaaa 45000 ctgcacccaa gttacagatt attggaatta ataaaattca ctggatctat atatttttaa 45060 acggacagtg tgatagcaga acctcttata gaatngatag aattcctctg gaatgattgg 45120 ataacttcat ttcatccttg acttttacct tggaggattt cttacccctt ttggcttctc 45180 aaatttgact attaaaatgt tgcctttaaa aataggaaca cagtttcagg ggggagtacc 45240 agcccatgac ccttctgcaa ggccccctaa ctcaaggtag tttccctgga actgtggttt 45300 atggaatgtt tcaggagtgt gaggaggtat aatttaaggc tgtcctagca aggataccct 45360 taaggataga gggcccagta gcatctggag gccagaaaag ttanaactga ggcagtcaga 45420 ttagcttcan ggctcaatta agctgatggg tcagcctggg agaaattgca ggatgactct 45480 caatatcccc tcccaccccc acagcagcca cgatctgtct gtctttaatc atgggtgcag 45540 tgaacctgtt ctttccaggt gtcttggcct tcagtaacct tgttaggctt gtccctgaac 45600 gtggctaccg atccaaagac acatgatcag agaggcaatt agagaacaga ccttttccaa 45660 agcaagcatg ttctgttggg cttagaagtt tcatgtccta atattatagg accctgtgca 45720 tctctctgga gatgaggcac atgagtcata tctgtgattc ttgcttttgt gtcaacatct 45780 catgaatagg caatcagagc tttggcacca atgtattttc agttcatatc tgatgtagtt 45840 aaatccacct cctgctttgt agtttactgg caagctgttt ttgatataag acatctagaa 45900 cactgtaaat atataacatt tttatttgtc tattatacct caattacgaa aaagacatct 45960 agaagcaacc tcatcaagag agatactgag gccgggcatg gtagctcaca cttgcaatcc 46020 cattactttg ggaggctgag gcaggtagat cacttgaggt caagagtttg aaaccagcct 46080 ggccaacatg ttgaaaccct gtctctatta aaaatacaaa aaagttagct gggcttggtg 46140 gtgggcacct gtaatcccag ctactccgga ggctgaggca ggagaatcac ttgaacctgg 46200 gaggcagagg ttgcagtgag ctgagatcac accactgcac tccaacctgg gcaccagagt 46260 gagattacat ctaaaaaata aaataaagta ataaaaaaga gagatattga tagctgttgt 46320 tggaaatttc aacttccatc tcacttctgg taactttttg gaagtttgtt gaacaaagtg 46380 gaatacacgc acatacacac acacacatac tctcttgttt gtttaaggtt taatgaaata 46440 gctgtcatat aatcactgtt tttgaaagag gagaattagt tgctatctgt acattttggg 46500 tatgtgaact atttggatag aactctgaga aatgcattca gaacaacaaa caaaatcata 46560 ggagaaatag ctaagtggga aggggcatat aagagttgtt gaaaaagtta tttcttgaga 46620 aaccagctct aatgctaggc aagtcacttg ctttggggga ggcctcagct tctctgtcta 46680 taagattgca gcaggggtgt agtgggaatg agtcttcaac attccaagag attttatcta 46740 ctaatacgac agtcaaatgg agcatgactt tgtggaagcc tctcctcttc cacccagagg 46800 ggccaatttc tctgtcccag tgagatgttg acacttgtat gatccctgct tggagacttc 46860 cctcttctgg aacctgccct ggctcaggca tgagggctga ctgtcaccct tcgataggag 46920 cccagcacta aagctcatgt gttggcagtg ttcttgcggg aaggaaaaag accagccagc 46980 ccatttgtta ctgcacaagc aaacagcttc tggtagctgt acagatacat gcactttctt 47040 tcctcactgt gtttccatag acagatttag tgctgtagaa gagtagaggg cagtcacggg 47100 aaggagttcc tgtttttctt ttggctatgc caaatgggga aaaatcctcc tatcttgtct 47160 ttttagtgtc atcctctctc cccttttctt cttctttata attctcatct ctcatctctc 47220 ctggaaatgt gcatgtcaag ttcaaaaggg cacaatgttt tggtgaggaa gaggtgggag 47280 aacacgtgcc aggtgctaac tagggtcatc atttccccct tcacagccag cttcctgtga 47340 atgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtat ttcttttgcc 47400 agcatcactg aatctgtctg ctgtctggta ttccaggttt tggtttaggg aaaagtaaaa 47460 gtaattttat aatcccagct gtcatttaag ccaccccntt tgtgggtagc atatggtcca 47520 ctctctcagt tcattgtcct aaagatgctt catcagaaag gaataacttc caccccgtta 47580 ctctctgtcc ccttactctg ctttattttt cttcgtcaat cctaccacca ccacccactg 47640 tttgaacaac ccactattat ttgtctgttt cccatccctg gtagaatagg agccccatga 47700 atgaaggaac tttgcttctg ttgttcacca ctgaatctct aaggtatgga acacacctgg 47760 catgtgatag gcactcgata aatatttgtt gtggctcatg ggcaccttgc agagttaagg 47820 ctgcagttgt ttgtggaatt tataagtggt aatgaatatt tatctactat tcctcttcca 47880 aggcgatcac acaataatca ggctttacac tatccagttc ttaggtcttc caagttatga 47940 cttgtgaggt atgttaatta tgataataga aggcagttta tttggttcag atttattgat 48000 gtgtaattta ccacagtaag acttcccctt tacaaaagta tgatgagttt tgacaaatgg 48060 atacacatgt gtatctacca ctgccatgct ccttttcagt ctgtcgtccc ctccacccat 48120 gaccactggt caccactgca gtgatttctg tccccttcat ttcacctttt ccagaatgtc 48180 atataaatgg aatcatgcag tatgtagttt tttgtgtctg gcttattttt cttagcatta 48240 ggcttttggg attcatccag gttgtcgcat gtaacagtag cttattcctt tttatggctg 48300 agtaagtgtc ccagttttat ttatatattt atttatgagg aggtgtctca ctctgtcacc 48360 caggctggag tgcggtagcg cgatctcagc tcactgcaac ctccgcctcc caggttcaag 48420 caattctcct gcctcctgag tagctgggat tacaggcacc caccgccacg cccaactaat 48480 ttttatattt ttagtagaga tggggtttca ccatgttggc caggctgatc tcaaactctt 48540 gacctcaggt gatccgccca cctctggctc ccaaagtgct aggattacag gcatgagcca 48600 ctgtgcccag ccccagtttt atttattcac cagttgatgg tcttttcgac aactaattgt 48660 ttccagtttt tggctattct gtataaggct tctataaata ttcacaaata cctaggatgg 48720 gatgactggg tcatataata gtactgtata accttagcag aaactgtcaa actattttcc 48780 aaagtggctc ttccatttta caattccaca gtgtattgag tcccagtgtc tccatacaca 48840 tgctagcact tttaatattt aatttagtgg gtatgtaatg atatctcatt gtggttttaa 48900 tttgcatttc tctgcagcta atgatgagtg tttctgctta tttgggaagg ttttaattta 48960 gcagtctgtt gtattctgta gatattaata acttcaaaat atcagtggca tttgcagtta 49020 aaatttcctt aaaaaattgg ccaaaggttt ccagcagtca cttctgccat gcccaaactg 49080 tatgaaacaa ggctgaggtg tggagattgt cacattttgg caaggagtga tccacttggg 49140 tgactgatga gacccagaga gcgtacgcct cgggcttgag ggtgaggacg ggcgggaagt 49200 cgactgcatg gccctgctgg ccttgggagg ctgcccagtc cttagctaaa gctggcagtt 49260 atgggaaaca gacttagatt ctattacgtt tttcaggatg tcccaggagt cacctgggaa 49320 gctcagcagt cctttgtgac tttcaagcat atggtagaag ctgctgaaca cagagctccc 49380 tctttgggga taatttgccc aaatcattta atcaggcttg agaaatgagt taccacaggt 49440 ccaggagtgc tgccaccctt gaattctgac accctatttc tcctatccgt ctcttaatta 49500 attaagcaga catccccaag tgcttacgac aagccaggac ccttttgcat actaaggaaa 49560 acagggatga aggaaacaga aatggtctct gctctgactc agaaggtaga aatcctcttt 49620 cccagccaag tcttcctagg gagcacgtag gaagggctct gaacccacgt gtcagttgca 49680 ggggaggata tcaggaaagg acattgaaga agtggagacc taagtttgag acctaggcat 49740 tagccaggct agcagtgctt gaaaaagtgt cttaggacaa gagaactcac cagtgaagtc 49800 ccagtggtag gagagcgtgc agcatattct gagcctgtat acacatctcc agggcattgc 49860 ttagcaggtg gggagtggca agagagtagg ctggagtcac agaagggagg ccaggtagac 49920 cttggtgagc actggactct atgttcaggt gctgaggagc tggcaaaagg ttttaagtcg 49980 gggagaggca tgttcagata tttggtctag ctgagtaact ttgggtgctc tgtgacaaat 50040 ggttgggaga ccagtgaggt ggcagttgcg gtcatctagg agcaggatca gagtggccta 50100 ttgactggga tgactgtgaa gtgggatcct ttccagccag taactggaaa tgtgtatgag 50160 ggcagaagtg agtgtactgc atttgaaaca ttgagaaatc tagtacatag tactgtctct 50220 tttatatctt tttttttttt ttttttgatt ttggtttgtt tgttcactaa cttggaaaac 50280 tgatgtggaa atgtcccttt ggcttcagtt acctgagcag aaggggccgg gcattgccaa 50340 actctcctct taggacagaa ttgctcccag tattgatcat tgtgttctga gttgggggag 50400 caaattgtgc aggaggccag gtcagtgcca aggtgggtgg gaggaattgg agcaggaagc 50460 ttgcctaagt gtgcccagca aagccacggt agaactttct actgtggctc tatgctactt 50520 cttagcaacc ttctccatgt gcttcctgga gagtccttgg agtcagaacc tttttcttga 50580 aacccagaca ctttacttcc aagaaaatgc tgtccaagaa aactcatcct tcccttcttc 50640 tcatgaacgt tgtgtagagg tgtgtcttct cttcctttga gcttttccac tcagggttta 50700 ggggaggtga tattctatat ttgggtttgg ctctgggtac tgcaacacta ggctattaag 50760 atttcatcct tactgctttg cccctcctat ctttccagaa acccacaatg gatttgctag 50820 aaataatgga acgtcctgtt tggacaggat ataaccattt ctcagctaga ggatattgtt 50880 ggaatgaaga aagataaatg gggagaaggg aactcacatt gctttggcac ttaaattaag 50940 ccatgtactg tgttgggaaa ttatttatat tatctcgttg aatccacagt agaacacagt 51000 tgaacaccat acaaggtaag tattgtcatc cttattttac catgaggaaa ttgatgctta 51060 gagagcataa agccttggcc aggggcacat agttgggaag ccggggctaa ttcatgcctg 51120 ggctctttct gatagttttc cttttttaat tgtcccctcc tcattgttac cttggggatt 51180 tcaagagatt catgtagctt ctaaatcaac gaactgattc ctggagagca gcttctgtat 51240 gagaaaaatc tagctaatta tttatttcag tgtctctgga atgcaagctc tgtcctgagc 51300 cacttagaaa acaatttggg atgacaagca tgtgtctcac aatgctgctc tggttgccag 51360 tgctgtgctg ccagttgtca tctttgaaca aactgatgca gtgctggttt aactcttcct 51420 ctttttggag taagaaactt tggaggcctg tgtccttcta gaagtttgct gagcaaatgg 51480 taaggaaaag aaataggtcc taaggcttga ctatttcaga gaatttcttg atttattgga 51540 ctgtcaatga atgaattgga atacatagtg gtaggctgtc ttttcttctc agacactgca 51600 atttcctcca atctcttgac ttttctagaa gttttaatcc aagtccttgt tgggtggtna 51660 gataaaaggg tattgttcta ctagagactg accttggcat ggagatctca tttggactca 51720 cagatttcta gtctagcgct tggttttgta tccatacctc gctactgcat tcttagttcc 51780 ttctgctcct tgttcctcat gcccagtgtc ccaccctacc cttgccccta ctcctctaga 51840 ggccacagtg attcactgag ccatttcata agcacagcta ggagagttca tggctaccaa 51900 gtgccagcag ggccgaattt tcacctgtgt gtcctccctt ccatttttca tcttctgccc 51960 cctccccagc tttaacttta atataactac ttgggactat tccagcatta aataagggta 52020 actgctggat gggtggctgg gatacacaga atgtagtatc ccttgttcac gagaagacct 52080 tcttgcccta gcatggcaaa cagtcctcca aggaggcacc tgtgacaccc aacggagtag 52140 gggggcggtg tgttcaggtg caggtggaac aaggccagaa gtgtgcatat gtgctgacca 52200 tgggagcttg tttgtcggtt tcacagttga tgccctgagc ctgccatagc agacttgttt 52260 ctccatggga tgctgttttc tttccagaga cacagcgcta gggttgtcct cattacctga 52320 gagccaggtg tcggtagcat tttcttggtg tttactcaca ctcatctaag gcacgttgtg 52380 gttttccaga ttaggaaact gctttattga tggtgctttt tttttttttt tttgagacag 52440 agtctcgctc tgtcgccatg ctggagtgta gtggcacaat cttggctcac tgcacctccg 52500 cctgccaggt tcagcgattc tcctgcctca gcctcccaag tagctgggac tacaggtgcc 52560 tgccaccatg cccagctaat ttttgtattt ttagtagaga cggggtttca ccgtattggc 52620 taggatggtc tcgatttctt gacctcgtga tccgcctgcc tcggcctccc aaagtgctgg 52680 gattataggc ttgagccacc acgcctggcc gatggtgctt tttatcattt gaaggactca 52740 gttgtataac ccactgaaaa ttagtatgta aggaagttca gggaatagta taagtcactc 52800 caggcttgag gcaaaattta caaatgctgc tgactttgta tgtaagggga ggcattttct 52860 tagaanaagn agaggtaggt ctctgggatt ccagtatgcc atttccatcc tcagtgtttt 52920 tggccacctg agagaggtct attttcagaa atgcattctt cattcccaga tgataacatc 52980 tatagaacta aaatgattag gaccataaca cgtagctcct agcctgctgt cggaacacct 53040 cccgagtccc tctttgtggg tgaacccaga ggctgggagc tggtgactca tgatccattg 53100 agaagcagtc atgatgcaga gctgtgtgtt ggaggtctca gctgagaggg ctggattagc 53160 agtcctcatt ggtgtatggc tttgcagcaa taactgatgg ctgtttcccc tcctgcttta 53220 tctttcagtt aatgaccagc cacnggcgtc cctgctgtga gctctggccg ctgccttcca 53280 gggctcccga gccacacgct gggggtgctg gctgagggaa catggcttgt tggcctcagc 53340 tgaggttgct gctgtggaag aacctcactt tcagaagaag acaaacagta agcttgggtt 53400 tttcagcagc ggggggttct ctcatttttt ctttgtggtt ttgagttggg gattggagga 53460 gggagggagg gaaggaagct gtgttggttt tcacacaggg attgatggaa tctggctctt 53520 atggacacag ractgtgtgg tccggatatg gcatgtggct tatcatagag ggcagatttg 53580 cagccaggta gaaatagtag ctttggtttg tgctactgcc caggcatgag ttctgatccc 53640 taggacctgg ctccgaatcg cccctgagca ccccactttt tccttttgct gcagccctgg 53700 gagccacctg gctctccaaa agcccctaat gggcccctgt atttctggaa gctgtgggtg 53760 aagtgagtta gtggccccac tcttagagat caatactggg tatcttggtg tcaatctgga 53820 ttctttcctt caggcctgga ggaatataat aactgagact tgttttattt ctgcagaggg 53880 ttctaagcca ttcacttccc agatgggcca ataatgcttt gagtaatctg gagatcatct 53940 ttaatgcgca ggtgaatgga actcttccac agagggatgt gagggctgta gagcagagtg 54000 aactncctga aactcagacg tcagctcttt gtctctctat ctctgaacac ccttccttag 54060 agatcccatc tctaggatgc atttctctgt agttagtttc taagtctctt gttcctgttc 54120 tgcctttatt tttttttcct ggattctaag ccagtatccc cacttggctg tcttaatgta 54180 gcttaacatg tctgtaatca aaatgatcat ctttctgaga ttcaaagggc tataagggac 54240 tttggagaga atttcattca gttttcctca aactagaata atgcttgcac tgtctgtaaa 54300 agaacaaaag tgtcaaagca tccttttgtt cactaaattt ccttttttat tatagtgtta 54360 cttaaatatt aggaagtnaa aagtaggtat aaacttcntt ataggctgtt attatacaac 54420 tatatgaccc atacatattt acaaattaag tgcagccaaa attgcaaaat caataccatt 54480 caaattaata ccttaaatgt ggtgaggcag ctgttgttca actgaaacca aattataagt 54540 tgcatggcag taaatgctat catgctgatc attttgagtt tggccagtct atatntatca 54600 tgtgctaatg attgaattct ccacccattt ttctacttgt atgaccttaa tttgatggca 54660 cctgttccat cctcatgagt ttgctacaat tatactggtg ccaacacaat cataaacaca 54720 aatataaact tgggctttga aatcttgtgc cagaacttgg ctttaaagta agcatttaaa 54780 aaatccatat gtgtttatta gactttgttt agatgactgt tgaaatgaaa acaaagtgtt 54840 taaaatcctc ttagagaact taaatataat ccctcagcaa tatgtataca gatcttcctt 54900 tgagaaaaac tgattgtgtt cagcctctca tgttacaaat ggggaacctg aattctgagg 54960 tctctagtga gagaacaggg actggaatct gtggantcct atctgtttta ataataattg 55020 taaagtataa tagataatat tatattaata aaataaaagc aaacacttag aatgagcttc 55080 catgtgtgag gcactaactg attaggcatt attaactaga tttattcctt ttaaggcccc 55140 gcgatgtact gttatttcca catgttgtag ctggggaacg tgctactcag agaggttaag 55200 taacttgtct gaggtccaca ccactaacaa ggagcacagg tagggttcaa atccagataa 55260 tctgactttg gagctggcac tctaactcaa tgtgcctaat cgcttttcag tggtgtcatt 55320 attttgccta ttctccatct gagaatattg aagtttctga ctccttcctt gcctttctcc 55380 ctgcctcccg tggttatccc caggtcttgg tgttccagtc ctctatgtcc gtccttactc 55440 ttattccttt gctacagtgt gatccagggc tcctgcccct tcttatcctg gtagaggggg 55500 cccacttgct gggaaattgt ctccgccatg gtttatccat gttgtgtgtc cattagtgag 55560 tagtgggaag aatcatatca tgttggcaat gaaagggggg ctatggctct ggggtagtct 55620 agtctgaacc tcttatttta cggatgagaa agctgaggta caaagcaggg aagggatttc 55680 ttgaggtcac ccagccagca actgagctgc aaccagaagc tgagatcccc aggactaggg 55740 ccgagcctca ttctgtccca tcacagtgac ttttcttccc tcctccaaac tatttttatt 55800 ttttattttt ttgcagctgc ttagcagctt gaagttagaa gaaagggcag ggaaaaggtt 55860 ttccgtgctt agccagggaa ggaatcctgc aacaggatgt ggggttgggt cattcaaatt 55920 gggccagact ccactggtct tgttgcttct tgcttggtat tgcagatggg tttaaaagtg 55980 ttaggattag agagataggc aggtttagcc aaaggcagtt tgtagccttg tggcagagtt 56040 ctttttaaag aaggaagtgg gatgcaacac cctgacacaa aggggcttaa gttgttatac 56100 cactgcctgc taacctgttt tccttaactc tcttcctgat ttctaaagga agtatatttt 56160 gctgaatcag aaagaaaagt gatttatttc aggttgctga tgcttagatt gttagagttg 56220 gaaagatctg gcttgcatct tgtacagctg acagaactgg ggctcagggg ggcacaggtg 56280 cccagagttg gtcagtcagg aaagtagcac cagaaccagt ctcctggtgg ccctacagtt 56340 gcagaccctt ttttgctttg ctctctgtgt atactaaagc ttctatgtct ctgaatctca 56400 agttctgact ggtagctact ttccaatcca cctggcttag atttctagat tatattgttt 56460 agacgtcaga acctcttaag ggttttgggg ccacttgtta gctcacatag tgagaaccag 56520 ccctgcccat taggtagggg aagaagttag cagtccatga tagctgttgc ctgcagcgta 56580 tggatgttca ttgcacagtt cctgtctcct gagatcctgg agtgtatacg cttggcctca 56640 gagcccagca cagagcctgg cccttgggac atgcttagta agtatttact gaatgagtgg 56700 gaaatgtctt aaggcccatt agtttgcagg tcttgaggag gctcccttgc actaggaaga 56760 atagaaagca tacataaagc ctgtgtgctg ccgccaggaa gactagaaac gctatgttca 56820 gcctggagct gaatggtata ccccagagca accctgttga aaggcagtgc ttgccttttc 56880 attctgtgtc ctggtttgct ggtaactcct gggtcccctg cctctcctgt acccccattg 56940 tgcagactga ggggggacca tcagccaggg ttagttttcc gctgtttctg ttaggcaaag 57000 aataaattga attgagttgt gaaagttggg tgcaaagctc agtttgggtc caaagtaaca 57060 gttaacttgt gtgggtggca ggtattcagt acaaacaggg ctggggacag gaaggggaag 57120 agaacttcag agctttcacg atcctcatct ggttttaggc tgatccagag gccaaggtcc 57180 ccatggaaca aactggacaa agtgagggtg gccacatggc ctcttttctt ttgcctttat 57240 tattaatttt ctcaaataga tctgactagt catgtggctg ggaaaatagt taattgtgat 57300 tttttttttt ttaaactgag tctcactcta ttgcccaggc tggagtgcag tggtatgatc 57360 tcagctcgcc gcaacctctg cctcccggga tcaagcaatt gtcatgcctc agcctcccgg 57420 gtagctggga ttatgggcac acagcaccac gcctggctaa tttttgtatt tttagtagag 57480 acatggtttt agcatgttgg ccaggctggt cttgaactcc tgacctcaag tgatccaccc 57540 acctcagcct tccaatctgc tgggattaca ggcatgagcc actgcaccca gccagagtac 57600 cactatttgg gcattcttta atgaaaaaga atgaactatc caaaaattaa aactcctcat 57660 ttatgagctt ttagagaatt ttacagagta gatggaaact ctctgcatcc tttccccact 57720 tctagtttca cctgacacat ttcttccctg tccttactcc tgggccggca gcagtggtca 57780 tgattccaat cccagcttgg ccaccatctg cctcagtggc ctaggaaaac tcctttctcc 57840 agagctttag ttttctcttc tacggaatga agaaagttaa aacaaataga catttattgt 57900 ttcatttgga taaatatcta ttaagcatct attacttgtg gtatggttag ctgggtatat 57960 agtggtgaag cagctgggca tgagtactgc tttcgtagag cttacagttc agtgaggcca 58020 gcagatgtga aacatatcat cacacaaata aaaatataac tatcaactgt gatgaggatt 58080 atgaaggaaa aaatccggca aactatggta ctggtgttag atactagcag gtgtgggtag 58140 ggatttcatt tagattgaca ggttgtcaca ttaaagctga gagccctgaa gttcaagcaa 58200 tggttagcca ggcaaagatc agaggcttag agatagggaa atccattcca ggcagagaga 58260 ctgggggtgc ctgtccccta ggtcagggaa cagaagaaag ccagtggcac tggtggagtg 58320 aataagactg gcgggggatg agttggtagt agacatgacc agatmattta gggcncaatt 58380 ctcnctgggg naaggagaat tntaatttaa tatttattta tttatttatt tatttattta 58440 tttatttatt tatttatttt tcaagacgga gtctagttct gtcgcccagg ctggagtgca 58500 gtggagcaat ctcggctcac tgcaactttt gcctcctggt ttcaagcgat tctncctgcc 58560 tcagncctcc tagtagctgg gattacagac gcccaccacc atgcccagct aantttttgt 58620 atttttatag agatgcggtt tcaccatatt ggccaggctg gtctgaaact cctgaccttg 58680 tgatcctcct acctcggctt cccaaagtgc tgggattaca ggcgtgaccc acagtgcccc 58740 ctgagaattt aattttattt tatgtgcaag aggattccct gaggtagtca ggccacattg 58800 tctggtgact cttgggatag agggaacttg aatgacaaag gcccaagaaa gcaattgtaa 58860 tcattacata tacatggacc attttatgct gttttcttct ttcatttaac attatttagt 58920 gtgcgtgttc acatatttct aaatcatctt ctgatttaga ataatgattt ctgatgtgta 58980 ggctgtgttt tatagttttg aaagtaatac tttgatatcc attacttntc ttgattctca 59040 cagcaattct gnaggtgtat gcgttgcaat ttctgtttca cagatgaaga gagtattgtt 59100 aataagttaa tggccgggca tggtggctca cacctataat tccagcactt tgggagacca 59160 aggtgggcgg atcacttgag gccaggaatt tgagaccagc ctggtcaatg tggtgacacc 59220 catctctact aaaaatacaa aaattagcca ggcgtggtag cacttgcctg taatcccagc 59280 tatttgggag ggtgaggcag gagaatttgc ttgaacctgg caggtggaag ttgcagtgag 59340 ccaagattgc accactgtac tcctgcctgg gtgacagagc gagactctgt ctcaaaaaat 59400 aaaaagttgc taagaggagg gctgggatct tttggctcca aatctactgt gggatgatgc 59460 ctttgacatt cctgatagct gtgcagtaat ccattaacac agtttttata agttcaaanc 59520 cctgttgcca acatttagat tgttccatgt gtgctgttac aaataaatta ctataaagat 59580 tctatacatt taatctttta ttatttttgt attatttctg taggccaaaa tctgaggaac 59640 aggattacta ggttgaaggg aaatggccct tgaagtgtct gatcagatgt ctttccagag 59700 gatccaacca atttaaatag ccaccatcaa tgcatgagac tttgtagttc agggaaggca 59760 ggcctggttt taaaaatcat ttcccctctc tagcattttt ctgatgtgat ccttaagatt 59820 tcactttagt tttcccaggt ctcattggca tgtatgctgt tagggatggg tctaaaatta 59880 atttttcttc acattcatat catgtcatcc cagtgattat ttaataaata atcacttgat 59940 taaatagtga ttccttttct agttattttt gggacattta ttaaaacctg gatatggtgg 60000 ctcatgcctg tattcccagc actttgggag gctgaggtgg ggggattgct tgagactagg 60060 agttcaacac cagcctgggc agcatagcaa gactccatct ctataaaaat aaggaaatta 60120 gtcaggcatg gtnggtactt gcctggagtc ccagctactt ggaaggctga ggtaggagaa 60180 ttgcttgagt ccaggtggtc aaggctgcag tgagctatga ccatactact gtactccagc 60240 ctgggcaaca gagtgaaact ctgtctgaaa aaaaaaaaaa aaaaaaaaaa aaannnnnnn 60300 aaaaaaaaga tgtgtaggga gcaattttgg agttattcat ttggtcattt gatatgtagt 60360 tttagttttg gtgctgatag agcccagaat gtaccctgaa tttgatgaac attctgatat 60420 atgggggagc tcattgtccc ccacttacct ttttgcctct cagaatatct tttgatattt 60480 ttatctgttt tttccccatt gaatgttatt accttatcaa gctcaaaaaa gtaccctatc 60540 gctattttaa gttcagttgt gttaaatcta taaattagct tgggaaattt ggatattaaa 60600 tgaactcatg aagaagcaga gtttagctct ccttaattct catcttcctt tatttatcct 60660 actacagttc tgtggttttc ttttatgtaa gaagcacatg ntttggctaa gttaatgcct 60720 aggttttttt gtttatgtgt ccattctcac tgtggatagt attctctttt ccccacatta 60780 tattaattta actggttttc agagactaat agcaatgcta ttatttagga gaatttacct 60840 tggttctgat taacttaccc atacttgcaa atcatttgca gctttttagt taactttgtg 60900 agttctctta gatttacgac catgccagaa acagaaagga tattttcatc tcttcctttc 60960 tgatgtttat tcttcttgtt tccttttttt atcccccatt atattctcaa gaatctctca 61020 atactaagaa atagcgactt catttttcag cgncggagtg cattattttg gctaccatga 61080 ttcagaagcc tcttgcctaa ggcccaattt tattctgcta gttttctctg ttctttgtac 61140 atggcccttg cgctgcccta accttgaatt aacgtggcta aatctcaaga atttaagagc 61200 accgtgactg tgtcctcagg ctagggaggg aaatgggttc acagagtgac tggattgtgg 61260 tctatgaact tcggcagcca gcagcaaaag tcaggcatga ataatcaagt ggacagtgaa 61320 catctgtagt gtgggagatg ttggcataac tatgaatgat gattcaagag tggtttgatg 61380 catattgaat aacatgatga taagtactag actctgtgct aagccttcta tgtgaaatac 61440 atttaattct cataataact ctagagcagt ggttctcgac cggggccggt tatcccccta 61500 ccccacccca ccctcaccct tccaccaggg acataacatc tggagatatt tttggttgtc 61560 acaatcctgg gaatgtatgt gctgatattt agaggttgag gtcagggatg ctgctgaact 61620 tcgtagaatt cataggagag tctctcacaa cacctatctg gccccaaatg tcagtagggt 61680 cactatcaag aaaatctgct ctagcagtgc ctgctcatat tatccccatg ttgaaatagc 61740 aagatgggaa gtgcaaagtg gtgcttcggt actcttggag cagctttgac tttggtgaga 61800 aacgcctttt aaaaacaatg tttcttccca tcttcccacc ccatggggag gtgtggggtt 61860 gggtgggtag gcaccaaagc aagatttaga agagttttct gtaggaattt ataatggtaa 61920 aggatcaact tcatttccaa gctatttatg agggtttatg tttaggaaaa gtgctaagct 61980 tagagaagga ggagaaatct gattttatta atgagtgtag ccataatggc atatcctggc 62040 agaagtcaac tttggtttct agagggaggc tattatgaaa agaaatacct ggaacattcc 62100 cctgggtttg gaaggtgagt tctaggttca atgatgggaa gaattttaga ggtccaagat 62160 aaaagggcaa agattaaatt ttgtctctca tgagttctct ggctcaggtg gtgtgaactt 62220 tgcagacagt ctctttaatt cactcataca tgctagtctc ccagctcagc aagggctttg 62280 agagagcagg tgtctgtatg ctctggtaag tgaaggcaaa gtgcataagg aggttggggt 62340 ccataatggc gaagagaagg agcccttcag tcagagtggc tttgaatctt ggctctgcca 62400 tttgccaatc ttggaccatt gggcagtgta ttaactcttt gaatctcagc ttcctcttct 62460 gtaaaatgtg tataacaaga gtactaattg gattgtttga tgattaaatg agttaatgtg 62520 tataaagcac tcacaaccct ggtacatagt aagacctttc attattatta tcatcatcaa 62580 ttttttttaa cctcttttcc tgatctgctt acactcacca gcttcagctg ctccaaatgg 62640 cttgtaagat tttttgtttg ccctttgctg tcagttgcca tggggaagat ccattcattt 62700 ttttcagtca accaacatat tttgagcatc tgctgcccta caggatccta gatatggggg 62760 ctgcagagat atccaggaac ataagccttg attaattggg tcagatcagt gctcagcagg 62820 gctggcaagt gctaggtttc ttttaagtgg catatcttaa aaggtatatg tcctnaaaca 62880 tagctttgtg atggcagcat gatgggtaca aaagcacaca cttaagtgtc agtagatctg 62940 ggttcaaaca ttggtgcagt ttcttatggc tcgtaacttg ttcaaacctc agtttcttca 63000 cttctaaaac ggtaatgata caacctacct cacagggtta ttatgaatta aatactggag 63060 atgagataca caaaacgtct tgagntacac agtagctgcc caatattggc tgtaagtatt 63120 ataaatctac aagctgtgaa ttaattttac ctctctggat cctgnttgat atttctagac 63180 cattccacct agtggggcca tttcctacct gagtcacccg tggtgtcaaa tagaatgtca 63240 tgtggcctcc tgagttgggt agaattggct gctcatctca accccgctac tgactatctc 63300 tgtgatttac ccttcctcca gccttagcct tgctacatat aaaatcaaga caataatgtt 63360 tcctatctca cagggttgtc ctgaggatta aattaagtaa ttaatataaa atgtgccttg 63420 tacatattgg gccctaaata aacagtagct actatttatc cttaaagtac aaatggtagt 63480 ttcagagctt caaggctgat ggctatttat cttactcata ctctttgttt agcttcattt 63540 ttttccccta atttcattag wattttcttt tctctttttt tttttttttt tttttttttt 63600 ttttgaggtg aagtctcact ctgttgccca ggctggagtg caatggagcg atcttggctc 63660 accccaacct ctgtctcctg ggttcaaaca gttctcctgc ctcagcctcc cgagtagctg 63720 ggattacagg ctcccgccac catgcccagc tatttttttg tattttcagt agagatgggg 63780 tttcaccctt ttgaccaggc tggtcttgaa ctcctgacct catgatcaac ccacctcagc 63840 ctcccaaagt gctgggatta caggtgtgag ccaccacgcc cggcctcata agtattttct 63900 aaatttattt acagtcatgc catttaaaag gaaagttgta ttcctgtctt tgttaatatt 63960 tataagtgat tttattcagc tacaagcttg gaatggcata taattttgta ttctgctttt 64020 ttcacttaat attacatggc taatgatttc tgtgtttcat aaacattatt ctgatgatgg 64080 catgatatat tgttgagtac atgtaccata attgaatcat ttccctattg ctatgcaatt 64140 aagttgtttc caatattttg caattataat gtttcaatga atgaataact ttatgcatat 64200 agctttttga tatcttaagt tcagtttcct aggatgaatt tccaggaata gtaattgggc 64260 aaatgggata aacatgactc ttgaatacgt attgttaaca ttgctttccc aaagggctca 64320 actgatttat atttccgtgt tcattatctt ttaaaccagc tcatttactc accaaacatt 64380 tttaaagcca ttatcatgtg gtaggcttag taagaagaaa gtgaccctaa gggagaagct 64440 tatatataaa tagggtccct ggtgtaccaa gtgctgatac agacacaaag tacctgggga 64500 aattgagatg agggagtcct ggctcagctg ggagaaaagt tcattttcat agagtcatgg 64560 ttttgttctt tggcagaaag aaaattgctt tcttccccac ccccaccccc agctttattg 64620 aggtataatt gacaaataaa aattgtatat ctttaagata tgcaatgtga tatatatgta 64680 tatctcaact taaaaaataa gctacagaat aaaaaggtgt ttgctattaa aaaaaaagaa 64740 aaggctgaat gtcattccca agcttggaaa tttgagtatg ttgcctcttt gggattattt 64800 acagaaatat tagcaagacc agccccatct ttggtcttga gtactccact gtcagcatgc 64860 tttcttccag agagggatcc atttgccttt atttttcatt ctgttgtgcc gtctatgcaa 64920 actattcttg atagttttat ggtaacagtg tttttttgtt ccatgagatn aatttataca 64980 tgctcattgt ggaaaattta gaaaagacag gaaagtatta aaanacatca cttttttttt 65040 tttttttttt tttttttttt taagnagaca gagtcttgct ctgtcgccca ggccggagtg 65100 cagtggcgtg atctcagctc acagcaacct ccgcttccca ggtttaagtg attctcctgc 65160 ctcagcctcc caagtagctg ggagtacagg catgcaccac cacgcccggc taattttgta 65220 tttttagtag agatggggtt tcaccatgtt ggccaggctg gtctcaaact cctgacctca 65280 ggtgatccgc ctgccttggc ctcgcaaagt tctgggatta taggcaggag ccactgcgcc 65340 agccacacct acgttcttat catcctagta catccactgt cattatcttg ctgtatttcc 65400 ttctgcccag tctcactctg atcatgcagt ggcgtgatca tgcagtgatc tcggctcact 65460 gcaacctagg ccttctgggt tcgagtgatt ctcctgcctt agcctcctgg gttcaagtga 65520 ttctcttgcc ttggcctccc aagtagctgg gattacaggc atacaccccc atgcccatct 65580 aatttttgta tttttagtag acacagcgtt tcactaaaat tttgtatttt tagtagagat 65640 ggggtttcac catgttggcc aggctggtct ccaactcctg acctcaggtg atccgcctgc 65700 cttggcctca caaagtgatt acaggcatga gccactgcat ccatcgccaa aaagattttt 65760 taaaagagtt taatgtagaa ccatatcaaa ggtctttgga aataaaaaac agttttttaa 65820 aaatatcaga aataaaacaa caaataaata aataaataaa aacacccaaa acaatctgaa 65880 gcacgagcac ctagcagaaa ggttcaatta tgatctattc atagagtgga atatcaagta 65940 gacattacag gacatgtttt aagattatat tttatgtcat gggaaatgct ctcccagtat 66000 gatgttaaat gaaaaaacag aatacaaaag tatatatgct gcatagtctc aatattgtag 66060 agaaaaaata ttatttatgt atgcatgaaa aaagacaaaa gatgttaaca gagatccatt 66120 gttacttcag tttactaggg attgtctctg ggaggtagga ttaaggtgat ttatatttac 66180 ctttttaaac ttttctgtat ttttttattt tcaaattttc cataaaaata taaggacttg 66240 aagatcaaga aaaaatttct gctttggctc agtgcagtgg ctcacgcctg taatcccagc 66300 agtttgggag ccctagggga gaggatcact tgaacccaag agtttgacgt tccagtgagc 66360 tatgatctcc ggatcgtacc gcctggacga tggagcaaga ccctgtctca aaaaaaaaaa 66420 tctttgcttt ttttttttgt ttgtttttga gacggagtct ctctctgttg ccccagctgg 66480 agtacagtgg cacaatctca gctcaccgca acctctgcct cctgggttca agcgattctc 66540 ttgcctcagc ctcccaagta cctgggattc catgcaccca ccactatgcc cagctacttt 66600 tttgtatttt cagtagagac agggtttcac catgttggcc aggctggtct cgaattcctg 66660 acctcagctg atccaccggc cttggcctcc caaagtgctg ggattacagg catgagccac 66720 tgtgcccagc ccaatctttt gcttttttta aaaaaagaag acaaaaaggg attttatacc 66780 agtattatct tggctgtgtg actctgaagc cacagttgta agttataatt actctgaaac 66840 acaaggccct gtgactcttt tgggctcttt ggtgtttatc ttgattacaa cgttggaata 66900 tagaaatgaa aggaatggga gaggtgatag acttcaggca gtgtaactag ttgtctgaac 66960 actactggct caattatatt gtgtctagtg atttccatct tgtccgtctg ctaatttatc 67020 gcctggtaac tcactgaggc agggttttcc tttggagaaa cctcattgtt ttaaccagtg 67080 tatcatgctt gtttagaagt tcaatgatct ttttaactca tcggagaaga tgatgaccag 67140 acctggacag atggggaagg actttgcact ctctctttac agtcctgagt gcacacaggt 67200 caatatggaa ctatgtgtga attttcattg tctttgagag ccctcttctc tgccccatag 67260 ggagcagctt tgtgtgcaat tagaggagca agggttgtgt gtatttagca cagcaggttg 67320 gcctggtcct ctcctctcaa catagtcacc acatacctgg cactatgcta aggctgggaa 67380 tgcagacaga tgggtgcctg ctttcagagt gctcaatgtg ctgaggaagc cagcaacaga 67440 aacagatgat ttcaggagct ccaggaaaat gctacaggag gagtgtgcct gggttactgg 67500 agtagcacag gaggagggct tctagctcag gctgagattt tagtaaagga aattatgcca 67560 cgatgaatcc tgaagaatga atagaagtga accagataaa gcacgatagg aagcatcttc 67620 ccttacctaa gggaagacac agaggtatat ggaatggtat gttaaaaggt tgggactcca 67680 aacagttctg ttaaagctta gagagtggtg ggagagactg gagaagttga ttaattagta 67740 aatgaagttg tctgtggatt tcccagatcc cagtggcatt ggatatccat attattttta 67800 aatttacagt gttctatctt atttcccact cagtgtcagc tgctgctgga agtggcctgg 67860 cctctattta tcttcctgat cctgatctct gttcggctga gctacccacc ctatgaacaa 67920 catgaatgta agtaactgtg gatgttgcct gagactcacc aatggcaggg aaaatccagg 67980 caattaacgt gggctaaatt ggacttttcc aaagatgctg tctttgggaa acatcacaca 68040 tgctttggat cagnaaaacc taggcttcta atttgttgat aaggcatgaa ctcaggagac 68100 tgttttcagt cctagtgaat ggtgataatt gtaattataa cagtagacaa catctctttt 68160 acacatttta aatcatgaaa atagaataac cttactgata attttagaaa gtggtgatta 68220 aaagcacatt taagataatg ccttaacacc tagtcttttc catatgcatg atgtcttaat 68280 cacacattgc aaatcatgga acacagaatt ttaagcagca tttgtgtaga acttctcagt 68340 tttactaata ttattttatt ttattctcat aacaaccttg aatagaactc agatcatctg 68400 tcaatcatgt attttgataa cagcctttac agtgagcata gaaaatacag tagtggctaa 68460 caacacaggc tccagatgtc aggttatctg ggtatgaatt ctggtgtcag cattcactaa 68520 gcatatgacc ttggacaagt gatttaagtt tcttttaaac agagaatagt aatacctacc 68580 tcatattatt attgtcagtg tatcatctta caatcacagt ctttctctta gggctgggct 68640 cagtgggtgg attgacactg cagaaatggc cagatctaaa ggatcaacat ttacgtagct 68700 gggaaatgta gctgggactt cagtttcact gccctagtga tttttcctac cactaagcag 68760 ctcagtccat acccctacga gacccacaag cttatgagat actgttcttc caggaaagca 68820 gtggggccag ggccaccttt taattgtgtt tcttggcctg gtcccatctt tctcacaata 68880 tatagcaaca gttatttact tgctgatttt ctaatgcaca tcacacatag tcatattaaa 68940 cacacacaca cacacacaca cacacacaca cccctcaaga aacattttct gagacgtgat 69000 ttcctgattt catcaaaaaa gaaaagagcg ggccaggcac agtgggaagt caaggtgggt 69060 ggatcacttg aggtcaggag tttgaaacca gcctggccaa cacggtggaa cctcgtctct 69120 actaaaaata caaaaattag ccaggcgtgg tggcgcacac ctgtaatccc agctactggg 69180 gaggctgagg caggagaatt gcttcaacct gcgaggctga ggttgcagtg agccgagatt 69240 gcgccattgc actccagcct gggcaacaga gtgagactct gtctcaaaaa aaaaaaaaaa 69300 aaaaaaagca taaactgaaa tttatatgca atttatatgc ctgtgagata attctgtttt 69360 ctcttttgga accccaaaga gatttttttg attgatgagc aaatacattt tagattttat 69420 ttaagcatta tgccaagcac cactgaagta taagtttcaa gggcaaactc agttttttca 69480 tctactagac gaatgatttt ctggaatgat tacaagcagg caangatggt gnntagtgga 69540 aatagcaaat gtcttcggca tcagacaagt tggggtttgt ttgtatcctg cctctgccct 69600 tcaccgaggt tgtgatcttg ggcagattgt tgagttttaa cctagattcc tctgactcca 69660 gatcataaat tttcagaaaa gttctgaaat tcttgtatat actgatggta aatgagactt 69720 ttccttacat ctatgcactt ctttgtttgt ttgttttgag atggtcttgc tctgttgccc 69780 agactggagt gcagtagtgc aatctccgct cactacaatg tctgcctccc aggttccagt 69840 gagcctcctg cctcagcctc ccaaatagct gagactacag gcatgtgcca ccacgtccgg 69900 ctaatttttg tatttttagt agagacaggg ttttgccatg ttgaccacac tggtctcgaa 69960 ctcctggcct caggtgattc gcccgcctca gcctcccaaa gtgctgggat tacaggcatg 70020 agccaccatg cccggccata tccatgcact tcttgcaacc ttaccttctt ttctcatcac 70080 cctccaggga cctagttgga agagcagagt taaaagttaa ggtgaaactt ggagaggtgt 70140 cttgtcccta ggaacaaagg actggtttga aattctctgt aaatcttccc cagttcaaac 70200 cagagttatc aaggtcttaa aaacttccct gggtcctgag agcccattat attatttact 70260 tgtcttcctg tacacccact gcctagtcct gatcctactt ttgtttgcaa ataggatggg 70320 gcacaacgta caaggaaggg cctttgccac ccctgctaag ggataacctg aaataccttc 70380 accatcactg ccctgtgctg cttttcacct atgccagtct gtctacagtg ccagtgtctc 70440 ctggcattga aaggggagaa tcttttggtc ctttgagtat ttggttgggt tacataaatc 70500 tccctgaatg aagagcagct gacttaggca aggggccttg tttggttttc cttgaactat 70560 taacaggaag atagggagat taactgtgta aatgttcaat aggccagagt ccctgcagag 70620 ggtggccaca gtgatcagat cttatcacat ccttgctttg ggtgttgcct ctctggttgg 70680 agtatggata gaaaagaaag aaagacccta tattgaaatg caaagtgcag caagtcctga 70740 ctttggatta acttctcagc ccatttgcat gaaaataaaa agatgaataa aacaaggttc 70800 ccactttgga gggaggtggt agctgtgaga tggaaggagt gttcctgctg ggcaacagca 70860 gagtaagtgc tggggtagat tcactcccac agtgcctgga aaatcctcat aggctcattt 70920 gttgagtctt tgtcctacac caggcactct gcaaaaacgc tttgcctgca aggtctcatg 70980 cgatgctcac cacagctctg tgaagttaat tgtactttta tcaccatttt acagatgaga 71040 aaactgaggg tatggggtca atgacttggc taaagtcact gcttagcaag ctgcagggac 71100 tggatgtgaa ttccaattgg tttgactcca aagcctgtga agctacttgt tcttcaccac 71160 ctagagctgt ggttcttgat aactgtgaac tcttttgggg tcacaaatag ccctgagaat 71220 atgatagaag caggagctct ggcctttctg tccatacctg aacaggtcct tgggttaaga 71280 gcccctcgtc cagggcctat taatcttgat cctcataagc agcatccatg tattacggcc 71340 gcaaaccaaa ctgtgccaga ccgaatccta ggaccaagcc caaatatgtc ccatcatcct 71400 tttggtaaga agctcattgt aagaaagaaa gaggagagca agaggatgac ctagtgcatg 71460 gggcctcatt gttttaatta gtgacaaaac aacaataata acaacaaaac ccccgaagct 71520 tcacagatga catcagaccc caagcctgtg tgtttttcag gtgcccttga ggagctttgt 71580 agctggcaga ggaggtgaaa ctgacaaatg tttggcagat ggaggagagt accagagggg 71640 tttgagatga gctaaattcc aatctaaccg caggtgtgag gaagaggctt ggattgggac 71700 catggagatg ggggttctac tcccagtcac gccagctgac tttgcgagtg ttctttgtca 71760 gtcactttat cttattttat ttatttttat ttttttgaaa tggagtttcg ctcttgtcgc 71820 ccaggctgga gtgaaatggc gcgatcttgg ctcactgcaa cctccccctc ctgagttcaa 71880 gcgattctcc tgcctcagcc tccagagtac ctgggattac aggcgcctgc caccaagccc 71940 atcgaatttt tgtatgctta gtagagacag ggtttcgcca tgttggccag ggtggtcttg 72000 aactcctgac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 72060 cgagccactg tgcccagccc acttcatctt accgtagtta cctccttaga gtatgaaaaa 72120 ataggcttag ggcatcccca agtcccctct atgtctgaga gctgaggctg gctgtcaaag 72180 aggaactaag gatgccaggg actttctgct taggacccct ctcatcactt ctccaacgct 72240 ggtatcatga accccattct acagatgatg tccactagat taagaatggc atgtgaggcc 72300 aagtttccac ctgagagtca gttttattca gaagagacag gtctctggga tgtggggaat 72360 gggacggaca gacttggcat gaagcattgt ataaatggag cctcaaaatc gcttcaggga 72420 attaatgttt ctccctgtgt ttttctactc ctcgatttca acaggccatt ttccaaataa 72480 agccatgccc tctgcaggaa cacttccttg ggttcagggg attatctgta atgccaacaa 72540 cccctgtttc cgttacccga ctcctgggga ggctcccgga gttgttggaa actttaacaa 72600 atccatgtaa gtatcagatc aggttttctt tccaaacttg tcagttaatc cttttccttc 72660 ctttcttgtc ctctggagaa ttttgaatgg ctggatttaa gtgaagttgt ttttgtaaat 72720 gcttgtgtga tagagtctgc agaatgaggg aagggagaat tttggagaat ttggggtatt 72780 tggggtatcc atcacctcga gtatttatca tttctgtatg ttgtgaacat ttcaagtcct 72840 gtctgctagc tattttggaa tatactatat gttgttaatg atatcatgca gcagacgtgc 72900 atctgaatgg gctggctcta ggagctagag ggtaggggct ggcacaaaga tgcatgctgg 72960 aagggtcctt gcccataaga agctgacagc caaggctagg ggagttctgt cttctctgca 73020 tcaggtcacc tctctcacct ctgtcactgc cccatcagac tacaatgtct gcaggtcttt 73080 ctcccctgag tgtgagctcc ctgagcaaag caggatgctg ccccttccct ttgtattcct 73140 ggctcctggc ttcagtgcct ggacataagt atgggcataa taagtgtccc ccaaatgaga 73200 cattgaggat tcttcaaatg cacaggaccg tgatgtgagt taggacggag taaggacgat 73260 gggatgtggc tcaggacaat cctgaggaag ctgcagctgc ggcacgcagg gccacactgt 73320 catgttcatg gaccctagac tggctttgta gcctccatgg gccccttcca tacacaaata 73380 ttaaaaatta tatttcatga ctgncattgg tataaagatg aatataatcc agaccagatt 73440 catgattatt catacatttt tagtgtatta acttttaatt ctgcttttaa aataaattaa 73500 aacattctaa tatgccctta agagtatccc aggcccaggc cactgagcct actgtggttc 73560 atggataagn ttggccctgg gggcatgtgt gtgcatgcat gtgtgtgcac atgcatgatg 73620 agccgggcct tgaagggtgg taagatttgg gtgtgtagac caatggagaa aggcatttgg 73680 ggcagtgatg atgggtgggg gagggaacat ggtgatgaat ggagctgggt gtggggagcc 73740 atgggagtgg gttagggcca gcctgtggag gacctgggag ccaggctgag ttctatgcac 73800 ttggcagtca cttctgtaaa gcagcagagg cagttggcct agctaaagcc tttcgccttt 73860 tcttgcaccc tttacagtgt ggctcgcctg ttctcagatg ctcggaggct tcttttatac 73920 agccagaaag acaccagcat gaaggacatg cgcaaagttc tgagaacatt acagcagatc 73980 aagaaatcca gctcaagtaa gtaaaaacct tctctgcatc cgtttataat tggaaattga 74040 cctgcaccag ggaaagagag tagcccaggt gtctggggct tgttcccatt agatcttccc 74100 caaggggttt ttctccttgg tggctggcct gtggggcccc tctccaggag gcattggtga 74160 agaaactagg ggagctggtt gccacagaca gtgatgtact aatcttctct gggaagacag 74220 aagaaaagtc cccagggaag aatactacag acttggcctt agggacagct aggggtgcag 74280 attgctgcca actgcatttt ttctgaagtt ggccatatgg ttgcagtgaa tggatttata 74340 gacagagtat ttctgtgcat ataagagcaa ttacagttgt aagttgatat ggataagtga 74400 aagttaagca cttctttcta aaaagagaat gcaattcatt ttcccctaat catttcaatt 74460 agtctgantg ggcatttgaa cttgttgtct ttaaaaagtg aaatctttac ctctgatctg 74520 gtaagtatcc aggcaatttc ttgtgtgcca cccaggaggt atctggggag tgggcatttt 74580 ctgactgagg cattggctgc catagcatca gagcagcctt ccaggcagtg gcctggcaag 74640 gggacagagg ctggtgggag cagctggctg agtgcagcca gtaatggcat gtgcatggtc 74700 tgtagagaat gtagaagcaa taatgaagcc gataaaagct ggtctgcatt ttattattat 74760 catgcgccgg tggttctaaa caatgtcagt gataaattac tcctccccat catggaccaa 74820 tggctgccac tgctccaggg aagtgctttt tattccgttt ggtgtttagg gagggatgga 74880 gttggctggc ctttgctgaa aggcctacca gtttgttttc tatttggcaa aagaagaaat 74940 gataaagtty tagagtttaa accagactca gatttgagtt tttttttttt tttttttttt 75000 tttttttaag gctacagaac tgtgctttcc ttgggcagta aaagaggcaa tgggcaatgt 75060 gggacctgat ngacaanagg gaannngnnn ncnnannnan nnanngctgt cttaggggtg 75120 gcatggagga ggtgctgctt cacagcagag agaggtatgg ctgtgcttgg agtgtccact 75180 tagacaactc ctggctgtgc agccaggcca tcgagatgct gtttccttga cctgcaggtc 75240 ctggtcttgc acatggatgt ttcttctggt gcaggagaca gaaaggtagc aacaacccct 75300 gatcaaagcc tcagtccttc cttatttact ggagagcccc tgctgattga ccagaggcac 75360 agctggggat atttccttta cctctgtagc aagagacagc gtggtgcaga ggaaagtgct 75420 agcatacatt accctgtggc tgcatgactt tgtgaatagg ttagttagca ccctttcagc 75480 cacttcttct tacctgttaa tgagataaaa catgtaattg cttaaaaaca gtatttggca 75540 cataggaagc acttagtgaa tatgaattat gatttttttt ggagtggtga catctcaacc 75600 aagccattta acccctcagc cttnactttc ctcaactata aaatagcagc taacttgaaa 75660 tgtaaactat aaaacctaat gtagtatctg gcacatagta gattcccaat aaatgagagc 75720 cagtattctt tctaagacag tgatgcattt ctgagcacct ggccttgttc ttctgccttg 75780 caatttatgc agcagttgaa atagactggc tgatgggggt aagttgtcaa gcagactttc 75840 tgatcttagt ggaggagact gccttaaaac aacactaatt tcctttttct tttcttttct 75900 tttcttttaa gacagagcct cgctctgtca ccaggctgga gtgcagtggc gcagtcttgg 75960 ctcactgcag cctctgcctc ctgggttcaa gcgattctcc tgcctcagcc tcctgagtag 76020 ctgggactat aggcatgctc caccatgccc aactaatttt tgtattttta gtagagatga 76080 gatttcacca tgttggccag gatggtctcg atctcctaac ctcgtgatct gcccgcctag 76140 gccttccaaa gtgctgggat tacaggcatg agccaccatg cctggccttc tttgagaagc 76200 tggagacatg agttaagtgg tgaagaagcc aaatctgtat ctaaaaaccc tacagtagtg 76260 tgcagagctc tgaggagaga aggtccctta gattttgagt gtattattat gtcagtgctt 76320 gttttacatc tctctgttca cgcagtatgt ccccttttct gccttgcagc tgtttcttaa 76380 attctttctt tctttgcttg tcttgcagca caaaacaggc ttcagtatag ggggaaatgc 76440 acagaaacac tgccttttcc tacaggaaat cagtaacttt ttactgattt tgtttttatt 76500 tacttatttt atttgtttaa atttattttt agtttttttt ttttttagag acagggtctc 76560 tttctgttac ccaggctaga gtgcagtggt gcccttagag ctcactgagc tcactgcagc 76620 ctcgaattcc tgggctcaag tgatcctcct gccttagcct cccgaagggc tgggatgaca 76680 ggcatgagcc actgcacctg gccaactttt tgctgattgc gaatagcact cttgtcaatt 76740 tcggagagaa gctgagactg gcatatgtca gtatggatcc ccacttagag acctgtgttt 76800 atctgcactg acaccccatc acagcatgat gagcttggcc ctcctgtgct gtctctccca 76860 gggctgggag gatccttgaa gctgatctgg tttggagctt tgtcctcatt cacctccttt 76920 accacacacc caccttccca gggcggggat ctaccactca ctaagtagcc cattctgggt 76980 gttgacagct ctaattgtta gaaaatattc accaccctgt tatgctttct agagaacaag 77040 tctaattctg ttttccttga aatagtcgaa gacagctctc atgtttttct ttccctgttt 77100 tcccaaagtc catgattttt taggcaaaat ggcctccttt cctcttatgg aatgtttttc 77160 ctccccattt ctgcctctcc tctggttgtg tttcagtatg tctgtgtgct tcttgaagtt 77220 tactggaaat tatgaaagta ttctggcaca gaggaggaag ggatttttgc ctcccttgtt 77280 ctgagtgcta catttccgtt aatgcagtct gagattgtat taggcatttt ggcattcacg 77340 tcaccttgtt gactcatatt ccatgtgcac tcaaacaaaa ttgtgattat tttaaatagg 77400 cagaattgca agttacgtgt tctccatttc tttgttgtat tgttggcttt ttgaactaaa 77460 gggaaaaatg tcttttttct gttttacatg tttagattcc ctatgctatc ctatcctccc 77520 aaaaccattt tagattctga ttttgccatg tattatatct gatactcctt tctcgtcatc 77580 tagagatgtg ataaacaact ctctttggcc tcattccagt catcgataac tgtgggacaa 77640 agacttgaag cttggatcag tccagtggag actaaccacc cctgtagacc cttttttcct 77700 caactataaa atagcagcta acttgaaatg taaactataa aacctaatgt agtatctggc 77760 acatagtaga ttcccaataa atgagagcca gtattcttct aagacagtga tgcatttctg 77820 agcacctggc cttgttcttc tgccttgcaa tttatgcagc agttgaaata gactggctga 77880 tgggggtaag ttgtcaagca gactttctga tcttagtgga ggagactgcc ttaaaacacc 77940 actaatttcc tttttctttt cttttctttc tttttttttt ttaagacaga gcctcgctct 78000 gtcnaccagg ctggagtgca gtggcgcaat cttggctcac tgcagcctct gcctcctggg 78060 ttcaagtgat tcttgattct gtagacacta ccactcaggc ctatattgta atcagtgctg 78120 ggccactggg ctcctgcttc tgtgatccag ttgggaagtt tatcttgttc ttcccttcag 78180 cttgcatctg ctaaattcgc tggactatac acaggtgatt tgtagatatg gggatctcta 78240 ctcaaatact ctcatgattt ccttggctag agcatcattt tatttccact tattggaaga 78300 gaccttagag accagttagt tcatttatag ataaattagt tgattctgtc attcaaccct 78360 tatatattga gcgtctccta tatgcgaatc actgttctaa gtgccgagac acagaggtgt 78420 ccaaaacaaa tatggcccct ctccatatgg aatttctatt ctagagaaga atctgaccca 78480 gaagggggaa gtgactgtcc caagtctaca caaccacaga agggatattc tgggaataaa 78540 tcacggctaa acccccctgc tgctccaggc agttctcctt ctacagtgct ctattgtgct 78600 gttttaataa tctttcaact gggaagaact cccatttcag gaattaagcc gtggacaaat 78660 cttttaatta tccttgaaat catcctaata agaaatccaa ggaggaagtc ttacagggtg 78720 cctcacccac ttttctcatc actggaactt tttagacatt ttattatttt cttcctaaac 78780 cagagtacag gcacacaagt tgagtggtgt ggtggctaaa ttaattaatg tttgcaaggc 78840 agtgtgagaa gcattcattc atcttaaata cctatggtga ctgcaactca gatgtaaaaa 78900 ttggataaat cctcagaaac cctagggaaa gtgacatgtc tgtattttgt ctctgtgaga 78960 tacagactgg cagagataag tgtttctctg ggtgagtttt gtgtggtatc tgggatgatt 79020 ttaggcagta cgtggatgag aacttttaat tttaacccac atccaattgc aatttcatgg 79080 aaattattgc ttaggaggat gttcaacagg aaaaatataa ttaaagttaa ttcaaaagaa 79140 acattttctg tgaatatggt aaaacttgtg agagtagttt gtaaatgatt gaagattgga 79200 aaacattggt ataagagtga gtgtggggtt ttgtattaag attcattttg ggaagaaatc 79260 catgctgcat ccctcatgaa gtgtgaactt tgggcatgtg ttgattcttt ctggcccaga 79320 gtttacctga agattagctg ccctgagggt cactgagcat taaattagat gatgtctgtg 79380 gatgactgat agtgaagctc atagcccaca gttgacacat aataaattcg agttgctttg 79440 cttccccttc tgttcctggc tgactgtttg gcctttgcca cttgttcggc ctctctgggc 79500 cttaagtttc tttgccttga aattggaact ttctttggtg aaacaaccag aaaatgcttc 79560 agcccagaaa cttggtcagt acttggatgg gggatcacct gggactaccc aaggatgtgg 79620 gctgtctgct agactatagc cccttgaggg caaggtgggc gccttgctca tggttccatc 79680 ctaagcccca gcacagtagt gggtgcatgg tgagccatta gtgaatcttt gtggaatgaa 79740 ggtgggagaa aataaaatac ctgtacttca cagggtattg tgagggtcaa gtaaaagtgc 79800 tttaaaaaat tgtattatat agtttattcc cttgtgttag cccaggtcaa cagagcctac 79860 gaataataat gatgacagaa gttcttcaaa aagtcttggc cttctttctt tcacaaaatt 79920 gccccccaga gctttctgga agggcagcca tgaacccaga ggcctaaagt agatttactg 79980 ggaagctaaa aatatttact ttatttttca tagctccttt caaggtcctc tctgggggtc 80040 ttagcaatat gtttacacag tggtatgttt ttgtaaggtt tgcaaaagta agatttttta 80100 aaaatactat cttgttttaa aaagagagcc ccctaccaac ttgtgtcagc ctcaggcccc 80160 cacctgcatc tgctcctgcc agggcatggt ggggcaagaa gcactgctcc ccttccaaag 80220 cttccttcct tgccctggag tcatcctcac tccccactcc aagccacctg ccatcgctgt 80280 gccccctctc tggtgaatct ggcattctta ggtgggctga gaagcagact ggcccaagct 80340 aaggcctttc tgatggggtt gttgctgctg agaatcatga ctgggtggga gaaggaggtg 80400 accctttngc tgtcttattt ttactgtgta tttccttttc agctacttaa aatgtattgc 80460 ttagtgatac ctaatgggtt cattagcctg cttcctactg aacatttccg ctcaggcatc 80520 cacttggtcc caaggcctgc tcctctccca tattctgaaa tctggactac agactctcat 80580 tcaactccag gttgcactgt ggacacagtc ccctcttgag caggtacctc cttgcagtgg 80640 ttgggacgtc ctacttggct catagttggg aagtgcatat gctggagctg aagcctcttg 80700 ccttcccggg atagggcgtc ctcacatccc ctctgagaag ttccccagct tccctctgtt 80760 ccccgtttcc acacttagcg aggctcttgt ccactgctac atcccccata gccagtccct 80820 cagccttgcc attgcttatg ctggtctgga aacaattcct agacttgtgg ggcatctggg 80880 gaagttctcc atcttttttt agctggcatg acccaagtgg tgtgggcagg gctgtggatt 80940 ctatggtgtg gctggaagcc aggtagcctc tctctactgt acatggaact cagcaacttc 81000 tgagtcaagc aagatcttag ctctgcaggt gtcttgccct gtccaaagtt atggccacac 81060 cagtaccttt taactctaga agcccagtaa gtgttttgtg ggaccgcaaa gatcattttc 81120 tagacctgct gaatatgcct agaacgggta gggatggctt tcacgctgtt cctagggctg 81180 acaagtcaca cgtttctggg ggtacataca caccgcggtc cctgtgaatg gcactctcca 81240 tgagaactgt gatgatttga gttgaatagt gcacagccta catggttctc tgccatggcc 81300 tggagttcct tatcttgcct tctccagtga ggactagggc tgcaactggc ctactttggc 81360 tcctgacttg ggggattctg aaataccttt tttttttaag gttgtggagc tctctgaagc 81420 ttataaggat tttgccagga aaagataaga aatatctttg ggcattttgt cactgtgctg 81480 gagatgaacc ctttggagga catatcacct tgttgaggtc aaggggcgga aagggacagg 81540 actggcagag agatccgggg cagcagcctg ccatcccgac tgagtatgga gtttctctct 81600 cccttcagct gcacttttgt gtggagtcag tggctcagct gccacttccc ttatgttcat 81660 ggcatgaatc tggcttgtta ggcctttctt tttttttttt ttttttgaga tggagtctca 81720 ctctgttgcc caggatggag tgcatggagt gtagtggtgt gatctcggct cactgcgacc 81780 accgcctcct ggattcaagc gattctcctg cctcagcctc ctgagaagct gggattacag 81840 gcgcatgcca caacaccctg ctaatttttt aatttttatt tatttatttt ttttcaaggc 81900 agaagaattt ttcttagtac agaacaaaat ggaatctcct atgtctactt ctttctacac 81960 agacacagca acaatctgat ttctctatct tttccccaca tttccccctt ttctattcga 82020 caaaactgcc attgtcatca tggcccattc tcaatgagct gctgggtaca cctcccagat 82080 ggggcggcgg ccgggtagag gggctcctca cttcccagaa ggggcggccg ggcagaggcg 82140 ccccccacct cccggacggg gcggctggcc gggcgggggc tgccccccac ctccctcccg 82200 gacggggtgg ctggccgggc gggggctggc ccccacctcc ctcccggacg gggcggctgg 82260 ctgccctgct aatnttttgt attttcagta gagatggggt tttaccatgt tggtcaggct 82320 ggtctcgaac tcnncnntng acctgttgat ccaccntgnn cctcangcnn ctcccaaagt 82380 gttgagatnt acaggcatga gccactgcgc ccggctctgt tntttttgtt tgtttgtttg 82440 tttttgtttt tgttttgaga cggagtctta ctctgttgcc caggctggag tgagtggagc 82500 ggcacgatct tggttcactg caacctctgc ctcccgnggt tcnaaacaat tcttcctgcn 82560 ctncagcctc ccgaatagct gggattacan ggcacttacc accnaggcct ggctaatttt 82620 tgtatttttt agtagnnang acggggtttt gcncatgttg gccatttgaa ctcctgacca 82680 tctttngaac tcctgaccat gtttgcncat gttgaactct tgacttcagn gtgcgttggc 82740 cntcccaaag tgctgggatt acaggtgtga gccaccatgc ccagcctgtt tggcctttct 82800 gatatgngnc ntcntgacta atctttttgg aaattnagnn ntccccaggg ttatactgga 82860 ttttacttag ggaaaagggt catgcnctct ctggctgtca gnatttactg atagtactaa 82920 ggnactcagn tggggtgnnn gnacctttga ttcnntnggt ttgatttttg aaaatncaaa 82980 aagacgtgag ctccagggag cagggtggct ttggtganca tggcaagata gttggctgtg 83040 gcnagggagt tgagggaagt gggtagaaaa ttaacatctt gtanaatatt nncncctggg 83100 aaatatacct tcgtgttaag agaacagact tggcagnccn nannngnnan nntnnnggcc 83160 taggttcaaa tcttggcttg atgcttatca gctgtgtaac cttggataat tccatacatc 83220 tctgtgcctc agtttcctca aatggaataa caatagtacc tccctcagga ctattgtggc 83280 aaattaatgg acgaataagg ggaagcactt agtacagtgc ctggcccagc ataggtacca 83340 ggcttgttct taagctcact gcatttttac aatcatcata aaatgcaggg gatacacaca 83400 tgaaggagcc gaagttcaga gaggccaagt aacttgccta agaaagcaca gctggcaagg 83460 ggcagtaata ggaccagaat tccagtttct cgtgctctgt tgttgttata tcctaaagag 83520 agcagctctg agtagccaga agcttcccta aagtcacagg acatggggca tgggctggct 83580 gggatgagaa aggagacaag agggcttctg aaagaaatgc cagattcact ccacttcctg 83640 gcttcaggca ccgatggaat gtttcccaag gcccatctag aaagaacatc ctgtgactca 83700 cagccacttc tcattttctg tctgaacccc ctcacccatt caggcagctg ctaaagttga 83760 ggttacagcc tcagactata ttttctgtcc ttgtggaacc ccagtgtgtc atcttgttgg 83820 gagatctggt gatacatgtg tcaacattat gtcatcaaaa tggaaattct ttgaaatctt 83880 taggtgattg caattcacgt tctgtatgta tgcacttgtc aaaagttttg atttgaggcc 83940 ttagaatttt atatttggaa acctttccac taccatgagt tttcccagac ctgtcaaagc 84000 caggctgcat ctcagaaacc aggtctttga tttccatcca gggcaagggc ctgggcccag 84060 ctgggctgta agcaggtggg ggtggggagc aacgctgcac tgcaatgttg aaatattact 84120 tgaactaaat caaatcaaag atcagcttta ctcagacaag aatagaaaac acaattgcat 84180 tcgattacag aatagtgtgt atccccacaa atatcagact gcctttaaaa agttttgaat 84240 tgttaacatc aagaacagtg ttgcgtgtct cctgcttttc cagcataagg tttatttatt 84300 ctgtgggtgg caaagagcaa tttgggagtc cagtttgttt ctcattgaaa gctttcccat 84360 ttctggtcct cttgtcactg ttgcattgag gcaccaaaag gcaatctcag tgcgacacta 84420 ttcaacagac taagttgcac cggataatga taccatttta catttttcat atattattac 84480 attgaaggct tcaaacagca ctagcagggg caaattggta ttattatctc catttcattg 84540 atgaggaaac cgaggctcga aagggtaatg tctgttgtcc aagattaaag agtaaagttg 84600 tacgttgaat cggggtctga ctcctaggct tagcattttc tccccacact atgctgccat 84660 gttgcttatt ccaacattag gaagcatagg tgccatcccc agcttttgag gccaatatca 84720 cgatgaagca tttttaaaac atctcattaa attgctgata tagtggaaag aaccaaagct 84780 ttgcagtcaa agctgtttgg gttcaaattt accacttgtt tgttctatga ccctgagcaa 84840 gatattctcc agatctgttt cctcatttga aaaatgggaa taataatacg tttctttata 84900 ggctgttctt aaagattctg gaaaataatg ctaatagtgt gcctaatgct tggtaaatat 84960 gagtcacttt tctgtgccca caaagcacta ctatgtccct taataaattt tgttaatttt 85020 aaaagttaga aaaaaattaa actatttata cattgtgtat gttaattctt ccctagacca 85080 gccttaggaa gaatctcatc cccaacttgt aaactcatct ttttccgttc tttttgtgcc 85140 tggacttctc agggccctgc aggctgattc tagtcccatg ttgtgtggtg tttgaagtgt 85200 ctggtccctt ttttcagtga gagaccagct catccttggg aactgaatgc ctcaaactct 85260 cttttctttt tctctcttcc ccttctgttt gatgtagtct ttcctgtttc tggactctgt 85320 ttcttcatac ttcctatctc ttaccttctt ttcactcctt ttgtcttcca gctgtcctct 85380 ctcatttttc tgcctctctg gtcttcaggt agagttttca tctcagctat cttcttgctt 85440 tttctgatgt tggttctttg tttcttcttc tcattctgtt caggtccaaa attcatttgg 85500 gtcaatgtta tgtcttagtg ttatcttcca tttccttctg agcttcaaag ccaggctgac 85560 tgtgcccttc cacgccctgg ccagtgtgac caggacatcc tttccctctg gggctgcact 85620 ggctctttgg gggaattgtt accattcagg gtcttcaacc ctcattctag ggacttccag 85680 taacttctcc caccttccct cttctcagta agacatgggt attgtcttat tctgtttctg 85740 ctgctagaag aaaattctcg agactgagta atttatacac aatagaaatt tacttctcac 85800 agttctggag gctgggaaat ccaaaatcaa ggtgttggca ggtttggtgt ctggtgaggg 85860 ctgctctctg cttctaagat ggtgccttgt tgctgcatcc ttaggagggg acaaacgcca 85920 tttcctcaac tggcagaagg gactgaagcc tctccctcaa gcccttttat aggggtcctc 85980 attgtcaggc ctctaagccc aagccaagcc atcgcatccc ctgtgacttg cacatatacg 86040 cccagatggc ctgaagtaac tgaagaatca caaaagaagt gaaaaggccc tgcctcgcct 86100 taactgatga cgttccacca ttgtgatttg ttcctgcccc accttaactg agtgattaac 86160 cctgtgaatt tccttctcct ggctcagaag ctcccccact gagcaccttg tgaccccctg 86220 cccctgccca ccagagaaca accccctttg actgtaattt tccattacct tcccaaatcc 86280 tataaaacgg ccccacccct atctcccttt gctgactctc ttttcggact cagcccacct 86340 gcagccaggt gaaaaaaaca gctttattgc tcacacaaag cctgtttggt ggtctcttca 86400 cacagacgca catgaaactc atggtatctg tgaggggccc taatgctatt ggtgagggca 86460 ctgctcacat aacttaatca ccccttaaag gccctgcctc ttaatactgt cacattggtg 86520 atgaagtttc aatgtatgaa ctttgagggg ggacacgttc aaatcatagc aggtgtgcta 86580 ttgccttact agcaaagtaa tctgggggaa gatgagtaat gtctcgttcc acccctgatt 86640 ccagtgctga ctgattctgg agtggcagag agggttggag gctcaccgct ctgctcaccc 86700 gaccctctgg ccatctcctc ttagaatgca agggcaggga ttttgttaca cagcgcctct 86760 tttgttatag gctagctcct gccttcaagg agcactgaga aaaatattat ccctttgaac 86820 cacaacacta gtattttggg tactgtagcc attcagaagt ttgttgaact aatacaagtt 86880 tatattattt gtaaaatact agaaggaatg tgtggttcct aaagttatgg tattcccatt 86940 ggttgaaaga gaaganctct ggcttcctaa tgccttggaa ggttcaggga gcataagcct 87000 aaatatcctg gtttctcttg ggaattcttc acagcttgtc atctatgact ttatccacat 87060 ttattttgaa catgtttaat tttcagcttg tattttctca aggagtatgt gtgagtcagc 87120 attcagagta cttaacacac ttagtgtcat tagtgataac ttaacccaca cgtataggaa 87180 gctagacctc catttacaga aaaatggagg ctcagagagc ctaactgact ttcccaaggc 87240 cacataactg gaaaattctg gaactgagtt ttttcaggta tgtcttggta tccctcggtg 87300 agccttctct gtagagaagg aagtgtctcc tgggttggaa agcccaggcc gggatgaaca 87360 gcatgggtgt ccttaggctg tgtgtaccac acacccctgg cctactctcc cctgctctaa 87420 ggagtcacct gaaggagcag ctgcatcacc ccgcctcacc ccttctcctc agcacccaac 87480 aaccaggtct tcccagagag tgctcagcta ttggtgaagc aggcatcgtc tattcttcaa 87540 gaagacagca gagcatcagg aaaacaaaac accaaaaaaa taccataaag acactggagt 87600 gtatttgtta ccagcccttg tataacagag aagtatttca aggtcttatt tgtaaggctg 87660 gacaagctca ggctgattaa aaccacgagt tagaatttgt ctcccgctcc caggccctgc 87720 tcagttgaca gctcctagca ggctgggagg ccccacccca cccccacccc aggggtccat 87780 tgaagaagac catctgggcc tgggatgctg atgtgatttt ccctacttct cctttttttt 87840 catttgcctc ctccacccac ttctaaagca gctgattgct ctggttcctt ttcatgttga 87900 gcaaaagaga agacagatct tgaaacctga gctggggact ggagggcact gaccgagagt 87960 aaccccgtct ggctcagctc agctcagcaa agggctttcc tcctccgcct tcctgagctg 88020 ctctccctcc tctcctttcc tccttctttc ccagcctccc ccaagctcct tcacccctct 88080 cttccctctc ctccctcctc ccaccttcct tgtcctctat gctcagagaa ctgaaggaac 88140 actgaatggc ctggaaagca ggacctcgcc cccaccccgc gtcatacccc tccgcctgca 88200 cgtgcaaatg tgttcctcag cttggtccca gaggcctgga cctgggtccc agaggtccca 88260 gctggttccc cggcttggtc ccagaggcct ggatgtgtgg caagaagctg agggttctgc 88320 ttcgtttctt ccagcccaca aacacacatg tcacctgttt tttgttcttt ttcctttcct 88380 ttgccccact ttacagaagc tcttggctaa ttggtaggtc actgtttgct cctccaggca 88440 aaaccaagga ggctgagctc agcaggtcaa gtattttggt ctgtattgaa gagactgtgg 88500 tgggagagcc ctcttgacct tctgctggat agtgactttc tcttctgctg gcagcagtag 88560 cccagctccg catcctgtga cagaggtcat accttccagt tacccgagcg aacacttgga 88620 gcaagtcagc tttttcccct gaaacagcac gttcacgttc ttgatctttg gtttgcacaa 88680 caacctgtga atgttattag ccccgttttt tttgtttgtt tgttttttga gacagagtgt 88740 cgctcttgtt gcccaggctg gattgcaatg gcgtgatctc ggctcactgc aacctctgcc 88800 tccaagcaat tctcctgcct cagcttcctg agtagctggg attacaggca tgcgccacca 88860 cacccggcta attttgtatt tttagtagag acggggtttc ttcatgttgg tcaggctggt 88920 ctcgaacttc cgacctcagt tgatccaccc gtcttggcct cccaaagtgc tgtattagcc 88980 ccattttata tccaaggaaa ctgaggctta gggagattta aatagggggg aaagtatagc 89040 ttcatgatta cacagtcagg aaatggcaga gctgactctt gaattcagac cctgtgatgc 89100 tgtatccagt gctgattttg ttttgccctg tgttcctttc agtgtcaggt aagaagacct 89160 cagagatcat cttctccaac ccacacattt tactgattat aaaatccagg tccgcattaa 89220 ctcattggcc aaagttaagt ggctaaatga ctctcctgtt caggaagtat tccctgtctt 89280 gtgtggaatg attggtcttt cgtccaggtc atcttccacc tatatccatc cttgcagtga 89340 ctggtggata tgatcattcc tggtgattcc ctgtaaaata ctagatttat ttattctttc 89400 ataaagttat gagtagacaa atcagtcagc ttcaaccttt cctttacctt atagatagta 89460 tcagcagaaa gaactcaata attctgtctt aggaactcag ccaggctaat atgtattata 89520 atacacacat acatatgcac acacatacac acatgtgcat accttagatc acaaattcaa 89580 catctcaact tagccgccaa agccacnttc tgtgtcttgt ctattctgaa tatgctttaa 89640 ggttaaatta gccatgaatc tgttcagatc agtcattcct gaaaatgatg atgatattag 89700 ccatccttta ccgggcacct tactcagtgc ccggcattat actaggtctg caggtacatt 89760 tgctcattta acacctagct cagtccaatg aggtggatgc cattgtcatc tccattttat 89820 catttaatca actcagcctt cctctctcct gttccccact ctgcttctag ctgccactca 89880 cctgactctc ctcttgatct gggctgacgt ggctgagtgg gtttagcaga actcttgtgt 89940 cctctttact tggtttgttt caagttcatg tcctgcttca tcttttccct aactcactgc 90000 aaggtgaatt ttcttctctg caatcggcaa ttccccaaat cacatcaggt cattaattta 90060 ttcactcatt tgtattcatt cattcattag ttcatccatc catctattca ttcattcatc 90120 caatttactg agtgcctcct ctgtgccagg cactgctggc tttggaatta aaaaccctgg 90180 ccctcaagga atttctattc ttctgggagc aggatattta catgggtgac tgcnagcttg 90240 atctgtgtgg tactagtgga aaacatagga tgttgggcac aaaagagtca caaaacgccg 90300 ggcgcggtgg ctcatgcctc taatcctagc actttgggag gccaaggcga gtggatcacc 90360 tgaggtcagg agttcaagac cagcctggcc aacatgatga aaccccaact ctactaaaaa 90420 tacaaaaagg aataatagct gggcgtgatg gcgggtgcct gtaatcccaa caactcggga 90480 ggctgaggca ggagaattgc ttgaacctgg ggctttggag gttgcaatga gccaagagcg 90540 ggccacttca ctccagcctg ggtgaaaggg tgaaactctg tattaaaaaa aaaaaaagtc 90600 acaaaaaagg gctggccacc taacccagct tgaggatggg aaggccaggg caggcttcct 90660 ggaggaggtg actcatgggc catgtcatga ggatggggta agaggagaag gtatgtttca 90720 gacatctggc tttcttaata ttcttggctt tccacccatt ttcctgccag caacataggg 90780 gagaagactg agaccagcag atacaaagcc actgtactct gtcctcaccc cctccttttt 90840 tctccccttc ctactaccca gatctgaggt tttgagaaat ctcttctcta attatctgct 90900 tgttcattca tttgttgaga atctactatg tgcatagcat tatgaccaca aaatgtcctg 90960 gtcgttgtca tcatgagagg tcttacccct tcctctccag ccacttctag ggatttttga 91020 ctctgtcctt ttccagaact tggctccagt ctggttgctc gccatgaagc acttacagat 91080 aaacctcatc ttgggccagt gcttccattt actgtctcct tttggcttgc ttatccttcc 91140 ttctgccttc ttgaattgat ttgcctcttt atctcccctt tcagccttga aagttcctcg 91200 aaggcaggga ctgtgtcccc atcttttcta taaatggcat tgttgtacat tggtaaagtt 91260 gagtgaaaaa atctttttga taatcacaaa ttattaccat atattgagca cctaccatga 91320 tgagtatctc tgatcttcat atctttgaaa tgtggatact attaccttca cttaattggt 91380 aagaaaattt gtgctcaggg aggtaaagtc actttctcac agtcacacag ctatttcaca 91440 ggagagtgca gtatcaaatt taggtttttc tggtcccaaa ccctgatgtt tcccccacat 91500 cattgtttct cagacttggc tgcaatcacc tgaaaaattt tattaaattc tttttttttt 91560 ttttttttga gacagagtct cgctctgtca cccaggctgg agtgcaatgg tgccatctct 91620 gctcagtgca accactgcct cccacgttca agtgatcctc ctgcttcagc ctcccaagta 91680 gctgggatta caggcaccca ccaccacact cagctaattt tgatatgttt agtagagact 91740 gggtttcacc aggctgatct caaactcctg acctcaagtg atctgcccac ctcacccagc 91800 caggttttat taaattctga tgcctgggtc ttaccctagc agttctcatt agttgctcca 91860 tggtgtggcc tgggctttgg gatttttgtg aagcccccag ctgagtctaa cgggcagtca 91920 tgtttgagaa ccagtatact acaccatgtt gccttcctgt tttcacacag gttggtgcta 91980 gtgtgtgagt ctaagcctac atgggtggat atccacgtga tcaggctgca agttctttgt 92040 agtggaggat ctttggccct ccctgctgct tcccaaccca gcgaacctac cacaccatgc 92100 acgtgcacag ccaagggttg ttgactgttt aatcagtctc tgggttctta tgtctcatcc 92160 gggggattca ctgataggag gctggagctt gatcacatcc agatgttctc atcttcagag 92220 cctgttaaca taacagaagt cttataaaca tgctgagcac actcactggt ctggagagtg 92280 ttgcaggatg acctaggccc attgcagggg tggcctgggt cgccctccca cacttctgcc 92340 accctgcttg ggaaggaggg agctgttttg aagttccctg gtgcaatatg atgtattgct 92400 ctggctctgc tcaggaggaa gctatggccc acctagtcag tgagggttag ctaatacgtg 92460 tattctgttt ctgttgaggt ttcacagagg ccttttttga cccttcattt tatagataag 92520 gcagtggagg cacaactaca tgaaatgact cgacaaataa atggatttag aacccaggtt 92580 tctgactccc agggtggtgc tttttccatg gttgtacagt gattaatgtc taccttttca 92640 caccagtcct caactgaaga caccagctta caccttcctt cctgtttctc ccaagaacag 92700 aaagtgaccc cggatgtcgc tttctgttcc tgggaaggca gttccagtgg ttagaagtcc 92760 tgttcactcc tggggtgtgg cctggggatg gtcctgacat ccctgggctc ttcctggacc 92820 tggccagcta aaaggaaatc tcctatgatg gtactcagat acttttggaa ccttgtcagc 92880 cctaatccca tctcctagtg tttagtattc agctaccctt cactgggcag taattctgtg 92940 ccaggcctga tctgggcatt ggtggtacta agaacgcata ttccctatcc tataggcata 93000 gtctggtaga gacaacatgc aagtaaaaca atgttcctta aactgtggtt ccacacctcc 93060 ctcccccaac attaaaagtg taagggatgc ttattcaaat gtagatttgt aggctctgca 93120 ctctagaccc actatttcag aatctctggg gactgggccc aagaaactgc attttcgcat 93180 gctccctaaa tgaagcttag gtgctctgag gtttgacaac tgcagtagag agcctaatgc 93240 taacagtgta gagtcacatg tgatgggaag catcaggtag gtagcagttt gcaggagcac 93300 tgattctgag ggacactaac tgggcctaag aacagctact ggctgtcatg aggaataact 93360 aggagctagc catagagggg tagcagtgaa tcatttctct agcgatgtaa atcttgctca 93420 atttattctg tctatataac tcaatattac tgaagtttgc ctaaagcaga atacacctgg 93480 atcatacagc atttatgaga gactggctgg gctgtcaggc cctcctgtta ctttatctct 93540 gcatgtgacc ctcttasstc cgcggattaa ctcctgtcct cattaagcct cacactgtag 93600 ccccattttc agatcaaacc tgtttctctt ctggtaaatg atttcagttt gcaaagtttg 93660 ccctctagag gttgcttagt gcctggccat gtgggctcag ttcatgtggt cctgatgagc 93720 tggttttatc tttattacaa agaagttagg ctgttaggag agtgggttgg aaggagaaga 93780 ggtagacagc caaatgagat gagtcaggga aaactatact gtttcggagt catagggctc 93840 ctaccaagca tctggtcaga aacctctcat tttggagatc aagaaattga ggttcagaaa 93900 gatgacatga ggtacgcagg gacgccacca gacacagcct ccaactctag aaactaaaat 93960 tctggattct tagtgctgct ttttctgttt tgttgcactg gattgaagcc ttttctaact 94020 gtactcagag ggcctattat ttagggagat tccgtatgaa atcctttagc aatcaaatca 94080 tttaataggg ctgatggttt aaatatatgt taatgtgttt tcctaaagcc tggcagacct 94140 gggttttgga gctttgtatc acatgtttta tgtttggaat gaaaatgaga ccatgtctgt 94200 gaaggcactt tgatatgcgt aatgcactct gccagtgttt gtcaaaacat ggtccccagg 94260 tcagcagcat cagcatcacc tgtaagtgat tctccagtcg catcccggcc caggmsmggg 94320 ccmrrctcny aggcyytgtg smmggcsmgg nnsggnnntg nccggcccgg cccggcccgg 94380 cccggcccgg cagtctgcat tttaacaagc tctccaggtg attctgatgc atacttaagt 94440 ttgagaacca ttgcttgttt tgcattaaac aggagattag tctctgcagc ttgtgggaat 94500 aaagctttaa atctctccaa ttttagctct gtgaaaaggc agtggggaga caggaatgaa 94560 cggactagtg ccacaaagct caggtggggt gggtgagatc atttagaaga gaaagaccgg 94620 gcatggtggc tcacgcctgt actgtcagca ctttgggagg ccaaggcagg ttggatcaca 94680 aggtcaggag tttgagacca gcctgcctat catggtgaaa ccctgtctgt actaaagata 94740 aaaaaaaaaa aatttgccag tcatggtgat gcatacctgt aatcccagct actcgggagg 94800 ctgaggcagg agaatctctt gaacccggga ggcgggggtt gcagtgagct gagattccac 94860 cattgcactc caacctaggt gacagggtga gactccgtct caaaataaaa aaaaaaaaag 94920 aaaaggaaag gctgtgtgtg tgtgtatgtg tgtgtgtgtg tgtgtgtgtg tgtgtaacag 94980 caccatcaca ctgtttgagt tgaggagcac atgctgagtg tggctcaaca tgttaccaga 95040 aagcaatatt ttcatgcctc tcctgatatg gcgatgctcc cctatctcat tcctgtgtgt 95100 gtttagccag gcaactgttg atcatcaata ttatgataac gtttctccac tgtcccattg 95160 tgcccacttt tttttttttt ttgagttact tactaaataa aaataaaaca ctatttctca 95220 atagacttga agcttcaaga tttcctggtg gacaatgaaa ccttctctgg gttcctgtat 95280 cacaacctct ctctcccaaa gtctactgtg gacaagatgc tgagggctga tgtcattctc 95340 cacaaggtaa gctgatgcct ccagcttcct cagtagggct gatggcaatt acgttgtgca 95400 gctactggaa agaaatgaat aaacccttgt ccttgtaatg gtggtgaagg ggagggaggt 95460 agtttgaata caacttcact taattttact tccctattca ggcaggaatt gccaaaccat 95520 ccaggagtgg aatatgcaac ctggcgtcat gggccagctg gttaaaataa aattgatttc 95580 tggcttatca cttggcattt gtgatgattt cctcctacaa gggatacatt ttaagttgag 95640 ttaaacttaa aaaatattca cagttctgag gcaataaccg tggttaaggg ttattgatct 95700 ggaggagctc tgtctaaaaa attgaggaca ggagacttta gacaagggtg tatttggaga 95760 cttttaagaa ttttataaaa taagggctgg acgcagtggc actgagttga gaactgttgc 95820 ttgctttgca ttaaatagga gatcagtccc tgcagcttgt gggaataagg ctttaaatct 95880 ctccaatttt agctctgtga gatggcactg gggaaacaga aatgaacgga ctagtgtcac 95940 aaagctcagg tgggatggac gagatcactt caaaggtctg taatcccacg tctataatcc 96000 cagcactttg ggaggccaag gcgggaaaat cacttgaggt caggagttcg agaccatcct 96060 ggccaacatg gcaaagcctg tctctactaa aaatatgaaa attagctcag cgtggtggca 96120 tgctcctgta gtcccagcta ctcgtgaggc tgagacagga gaatcgtttg aacctgggag 96180 gcggaggttg cagtgagcca atatcacgcc attgcactcc agcctggctg acagagtgag 96240 actccatctc aaaaaaaaaa aaaaaaaaag aattttataa aatcaggaaa taatattagt 96300 gtttatgttg aattttaact ttagaatcat agaaaacttc ctctggcatc attattagac 96360 agctcttgtg cagtgggtag caccagaccc agcttgcatg gttattgatt tttcagagac 96420 actttttgag cttattctct ggcagaaagg ggaactgctt cctcccctat ctcgtgtctg 96480 catactagct tgtctttaca agaagcagaa gtagtggaaa tgtttattct tgaaaataag 96540 ctttttgctt cacatgatct agaattttta aaattagaaa aatgtgctta ctgcgtgccc 96600 ttctgaaact aggaaaatat gccttgtgtt taccaattgt gtggttagga gatgggccaa 96660 aggcatcagg cttttgaaag tagttgcatt tagcataatt tccattgccc cctgccaatt 96720 tcatatctgt cacatctaat cagtttaaaa taaggggcat cctaagcatg gagatggtcc 96780 ttggatggtc cttggagttt ctgtattttc agtattcttt tttttgagca tacaagacat 96840 ttattgaaaa attcttggga tcaatacttg tgtaaggaaa ggaaagggaa caaagcatga 96900 ttgggcagag gcagaagatg acatcaacaa aggccctccg ttgtcagtat tctttttttt 96960 tttttttttt tgagttggag tctcgctctg tcacccaggc tggagtgcag tggcgtgatc 97020 tcagctcact acaacctctg cctcccaggt tcaagtgatt ctctgcttca gcctcctgag 97080 tagctgggat tacaggcatg caccaccaca cctggctaat tgttgtattt ttagtagaga 97140 cggggtttca ccatgttggt tggccagact ggtcttgaac tcctgacctc aggtgatccg 97200 tctgccttgg cctcccaaag tgttggaatt acagacatga cccactgcac ccggcctgtt 97260 gtcagtattc ttaaacatag acactaactg taggctgaca gcctagcagc aaggaccagt 97320 taaagaaatg agtagaactg aagtgtgctt gagtatctct ggcagtcagc aaaaacttaa 97380 tgggaatcat ggtaggccaa atgttctgca gtatttcaaa agctgcatgg gttttgagag 97440 gcttttggtc catcactcac cttaggttgt tctacaagca catcagcctg ccccaattta 97500 aacagggcag ttagtaatgg tgtaataaag agtctgcatt gttcattcct tcaacaaata 97560 ctggctataa tgtttcagca ctgtggatgc aaagtgagca ggataaacag gctcttcttt 97620 caaagcttgt ggtccactgg accacgtatg aagtagaata gtttaggtcc agaaaggcaa 97680 ttaagtaaaa tatgaccaag aagaggctct ctagtgggtt tggtataaag aaaagataag 97740 aatgatttag aattggccta tcaatgagat aagaggcctg gctttctggc actctgctct 97800 agggcaagta aaatggagaa ttccaaattc tgaaattgtt agaacatagt tctgtgtctt 97860 agttaaatat ctacacttac agataaatag cataaatgct ttctccccat atttcagccc 97920 agtcctactt aaagacaaca taaattgcaa aatagtgagg atgttgttca tctaataaaa 97980 gtggttccag gaattcagac tctggattcc tgtttgccaa atcatgtgtc ccactcttaa 98040 gaaaacgagt tggactctgg atttttcttt gcaagaggga caagagtgtg ggagatactg 98100 agttaatgca acttgcaggt tttaagtgtc ctgtcattgt gccttgtgct ttgatacatt 98160 ctgagtttca gtaaagagac ctgatgcatt ggactgttgc aatggaacct gttttaagat 98220 cttcaaagct gtattgatat gaagttctcc aaaagacttc aaggacccag cttccaatct 98280 tcataatcct cttgtgcttg tctctctttg catgaaatgc ttccaggtat ttttgcaagg 98340 ctaccagtta catttgacaa gtctgtgcaa tggatcaaaa tcagaagaga tgattcaact 98400 tggtgaccaa gaagtttctg agctttgtgg cctaccaagg gagaaactgg ctgcagcaga 98460 gcgagtactt cgttccaaca tggacatcct gaagccaatc ctggtgagta gacttgctca 98520 ctggagaaac ttcaagcact aatgctttcg gaatgtgagg cttttccttg gacagcatga 98580 ctttgttttg tagaaaagta cggctggctg ggagtttgtg atataattta gttcagtggt 98640 attctaagtg ttcttagtgt tctttcagac ttttgggcca tctcccaaag ggtgaatggg 98700 aagaataagc tgggtgtggc tgagtttaag ccaaaagttt tttgtgcttg tttcaatcag 98760 agaagacctg ctttttcatg tttttactat tataatacta agcaagagct catttgaaaa 98820 cagagttctt catatttaaa aaaaaaaagt cttgaaacca ttgatgggaa gatggatatc 98880 tatttatgtt taaaaaccca tcataaagat gacattgtgg gctgtcacag ttggaaggcc 98940 ctggaattag atgagaccac actatttagc ttacttagta ataacattgc aaagaaaaat 99000 tccgacgaag ttttttcagc ctaggaatca atagttcaga gaagcactct atgagaatac 99060 ccattcattc ttaaccaaaa aatactggtg agcctgagca gtttggtcat cagagtgttt 99120 tatatagttc cagaacaaat atgtctctag gtgttctgag agctctggtg aaattcctct 99180 cgctacccca aacatcatca tttaatatcc aggattctgg ttttctactc accagataga 99240 ttctcttaaa accagggaaa gattcctgga ggaaggatgt atctggaaag agatgttcct 99300 tattataata aaatgaaatt gtaatactct tggattttgt gcagcacgaa ttctttatag 99360 agagttggtc ctcccagaga attaagaata ctcagtttct ggaccctgtt cccagatcat 99420 accctagaat gtgaccttag aaacacactt caggattcat acctttgatt gaccatcaaa 99480 aagtttttgt atcggccagg tgtggtggct cacgcctgta atcccaccac tttcggatgc 99540 cacggcgggc agatcacgtg aggtcagcag tttgagacca gcctggccaa catggtgaaa 99600 ccctgtctct acaaaaatac aaaaaatagc cgggcgtgat ggtgggcacc tgtaattcca 99660 gctactcggg aggctgaggc aggaggatcg cttgagccta ggaggtggag gctgcagtga 99720 gctgagatct gtctcactct gttgcccagg ctggagtgca gtggcgcgat ctncgactca 99780 ctgcaacctc cgtctctcat gcttaagtga ttctcatgcc tcagcctccg agtacctggg 99840 actacaggca cctgbcacca cgcccagcta ttttttgtat tttntagtag acatagggtt 99900 tcactatgtt gcccagctgg tctcgaactc ctgagctcaa gtgatctgcc cacctcgggc 99960 ctcccaaagt gttgggatta aaggcatgag tcaccgtgcc tggtcccatg ttataatttt 100020 aaagtaaggt atatttctct acagggatct ttgcaaccct aagtaanctg gcctaaaaag 100080 ttagagaagc tgacttgtgc agacatttgc agcctgttgg tcttttttgt gctgtgaatc 100140 atagagggtg aaaggttatt atgaatggta caaaactttg ttacaaaacc attttcttgg 100200 actgttttgg gctgcttcac tgcatgacaa atgctcaccc tttcagctgg aatgattgaa 100260 attttggaaa agatgggtgt ttttagaaga cattgtaatt tgttccggtg ctgtgcccat 100320 tcattccatt tcacttctgt ttactcatta aacacctatt gtgtacacaa cccggtaaaa 100380 tccctccact cacacaatgc ctgaattata ctcatagtag aatgactgtt tagccctcat 100440 catctgataa ttaacagctc aggtttcaac ctgacagtat ctctctggga ggattagcag 100500 cgtgacagag tgcagggaaa tgcaccttca gaaccgtcag ctacactgtg tcccatcctg 100560 ctgtgttgtg gttgtgcctt gtggatgcgt tggtttatga ccaggtattg attaaggtgg 100620 ctactaccag gtgctttctg catatctcgg gtttgtggag cactcaggtt ctgcttctgc 100680 ccctctgctg ttaccaagag acctctcttc aaaatggggc tcttgagtta gagtagaatg 100740 agtgatcagg attgttttgt gtaaagatga tttctgagga aggctttagg atgaaatgac 100800 ttccaaacat tttgaaatgt gactcttact tattgaatta agcagggcct taattggaat 100860 gctgggactg atacttgatt tgcattaagc agcctttttc tattgctgct tggnttgaaa 100920 tttcaacatt tgtgatggta gatggatgtg acatgtgatg acattgcaca tgggcagtta 100980 actgtgccaa gaagtgcagc agtagcagca accggagatg caaagcccaa catgatgggg 101040 agagaaactc ttctttcaat atgtgcttct gtaccaaaag tggaatttca cgagagacat 101100 attttgaaac atttctcctt ttgtgtgtgc gtgagtgttt ccctgtttcc agccaagggt 101160 attgtgagtt tctcctgggc ctccttcaga atctgggtgc tctggaaagc agtgttttgg 101220 caacatgggg aaagtatggc agtgtgggag ggtcagctgg gtctgggttt gaatattgca 101280 tttgaatatt ttaccagcat tgatgtcgga taaattattt agtccctgta agcctcagtt 101340 ttctcttctt ctacatacac ataatatatt tgactctttg ttgtgattat tggttacaca 101400 tatgaagagc ctggtgtggg gcctggcaca caataggtgc tcaataaata gaagttgata 101460 atttaattga catgagtagt agaaattatg tccttgaaaa caattgcgtc aagatagaag 101520 ttttcagcca ggcacagtgg ctcacatctg ttgtaatccc agcatattgt gggggccgag 101580 gcggatgaat cacttgaggc caggagttca agaccagcct ggccaacgtg gtgaaatccc 101640 ctctctacta aaaatacaca tatttgccag gcaggcgtgg tggcgcacac ctgtaatccc 101700 agctactgaa gaggctgagg cacaagaatc gcttgaaccc aggaggtgga ggttgcagtg 101760 agctgagatc actccactgc attccagcca gcgtgacaga gtgagactct gtctcagaaa 101820 aagaaaaaaa agatagaagt tttcttctgt agatcagtgt tagaactcat accaagcgaa 101880 gtggtcctgg tgagtatttc agtgaaaaac tgcattcttg ctcagatatt gtcaagactt 101940 ttcacccaaa gattcttatt tatgtctcag tccgtacctt gtgtgaaaat taatactgga 102000 tgtcagaacg ctgttgtgtt tttaaagttc cctggggtta agagcagttt ccattaggtg 102060 ttctctgctt tttacttaaa aatcttactc atgcattgag caatatttat tcagttctta 102120 ttatgtgtca ggtattttct aggagctgga ctcaactcaa aagatatcct tttgatgaga 102180 acaaaggtgg gtggatatat gaaatattat ctgtgggata aatgcactta gtcatgaggg 102240 agacttgtta tggagtgcgc tcattgtatt tgtactgttg agttaacaac ttctaggagg 102300 agctcagggc cacctggcag gggcttcttt tgtcttgctg ctcagcaagg tgtattttgc 102360 tgtagagtgt gctgggcagg tgaacttttc ttaactttct cttgggtcct tcctaaagca 102420 gcatgtacct ttcccagagc gaggagaggg ccaccttcct gtctcacaga aacctccaat 102480 ctgttttgga ctgcaggaag gagccatagt agtggaccag caaattttgg cccgagagat 102540 tgatttgctt ccgattgtac tttttttttt tattgctact ataacaagtc accaaatact 102600 tagcagttta taaccacata catttattat ttcataaggc cagaagccag gatacagtag 102660 agctcaccta ggtctacttc ttagaggctc acagggctga tatcaaggtg tttgtggggc 102720 tgtgtttttt tgggggggct ctgaagatga atctgctttg gagctcatcc aggatatcag 102780 atgaattcag tcccgtgcag ttgtaggact gaagtccatt tccttgctgg ctgtcagcag 102840 gggctggttt ttgctcctaa aggttgccat cattccttct tatgcttttc atgtgactcc 102900 tctcccacaa gtgggttgag tatctctcac actttaaacc tctgacgcct cctgtcacat 102960 ctgtctccag tcagagaaag ttctttactt ttaagggctc aggtgtttag tttgggccca 103020 ttcacataat ccaggataat ctccttattt tgaggttcat aactttaatt acatctacac 103080 agtgcatttg ctatgaaatg tgcatattca cagactccag gaatagggtg tggacatctt 103140 tgcaggggat attcagtctg ctgtagcttt cttgattgga aggaatagtt tatcatatat 103200 ttgaagtgtt ccctcagtgc ctttggcttt tgttgacccc tctgcagctc tctgcttttt 103260 cttgcccata tttggaaagt gactctaaag cataataagc atcacctatt agggttttta 103320 atgtacaaac caaacagagg tgactttggg aggagaacat ctctgaacta ggtatgagac 103380 attcatcgaa aaaaatccat caagtgttta ttgtatgtct gctttatacc agcactgttc 103440 taggcactga agttcgacca ccaacaaggc agttgtgatg cacttggagc tttcattctg 103500 atgggcattg agatccaggc tgaaggctga gtctgggaat ttgaggaatt ctttgtaggt 103560 cctgggtcta cagagtgaga gctgtcctcg ttccagtttc actgatgacc tctcgaccag 103620 ctccctcaca gcagtctttg ccaacacaga cactgtgggc tgtagtggga ggaatgtggg 103680 gttgaatgag ctaggtttgg gctctgtcct tgactcacca ttgcctcagt gatgtgaaaa 103740 tggtgtctga tccttaaggt tggaagtcca ggcatgcaga tttatctcca tctcaataac 103800 gtggggaaaa aaaagaagtg gtttttgtgg aaccagtggt tccactgtcc acgtgtcttc 103860 cagtgtgcct agcacattcc caccaagtgg atcttggttc atgagggaaa gaagggaaag 103920 tgagggtgcc ccgaggctcc cagaagacaa gtaatgtcac agctgagctg tgtacagatc 103980 gaagaagcag atggataagg agtgaacaaa gtcatccttg tcttgagggg tctttattta 104040 gcttcttctt gactcttaga caaggaccca gaatacagat ggggcttgtt gttaccttca 104100 gcctcatggc tttttagggc tagatacttc agcttgttac atgcagtcct taaagcgtct 104160 gtgtgggttt ttgcaggaga gaacacttgc tcgtgttctc ctgtctggaa cccggacatt 104220 gttggaaagt atcagatttt gtttggcttt gtgtgatttg actgcccacc tgtttacttg 104280 ctttctccca gcaagcagcc gcaatcccca tgggttggta atgggtggaa tgacacactg 104340 tgtagattta ctcttcagac tctatgttca cctcattctt atgggaaaag aagagcacta 104400 gctggtagat atagcagtgt gttaatatga ctgctcaact ccatttacac agacattttc 104460 accttagtta tactatttct tcattaaata ttgttgccag atctaagata caggtttaat 104520 tttttcctct gaattatgtg gtagtagatg tatttaacta tgtttagaaa cagcaaaaaa 104580 tgaagcgttt gaatgcgtta aacacatcta atttgaaagt taatatttag gttgttgatt 104640 tattttttaa aagattagaa tattccttag aaatgtagtc ttataatttc gtatttcaat 104700 aaaaaatatt aaaatgtttc ccagaggaaa tctttactgt catagaaatc accagaaaga 104760 gatagcaatt actcctgggt ggtgatagtc tttgatttgt ggtttacttg ttttcagttt 104820 gaactaaaac atactaatcc cagcctgagt tggattttgc atatggtcaa agtgaagtag 104880 agatttttgt ctactttatg aagatattta aaggacattt gaaatgtttc aatgaacaca 104940 ttgtaacatg cattcccagg aggaaaacaa aattgtgtat tgtgttgaaa atactattcc 105000 aatatatgta tacccacgtc tcattttgtc atagaattcc tgaaaaattt agatgtagat 105060 gagattttta taagttgaaa atattttcag ttgcatttta attgcacaga tgtgttttct 105120 acttctattt ggctgtgact cctaatgcat tgtaatatta tcttccagtg ttcatttcgt 105180 gtagtgataa tggattaaca aatgcattca ttcatttagt aaacatttac tgagccctaa 105240 tcagtgccat gctctggccc agggaggtag tcatgtgcaa ggtccagccc tctctgtttc 105300 attcctggtt ggggaatggg tgggacagac caggaaagga gcagctgccg tatagagcag 105360 gagacgctct gctgggtgtg atacagtccc tgatggggcc tcctactgca gtctttccag 105420 tcaggaaaga aacatcctct aagtggggac ctaaagaatg gatgtcagcc aggtaaagaa 105480 cagggagtag aatgttccag gtagaggaca tagtatatgc attggtccag agaggagagg 105540 gaagagaagg tggcatttag gggaactgta cctgattgag atagttcagc atggagcaga 105600 tatagctgtg gggagtttga tgggagaagg aacgatgaga cgtgatgcct gcagaggtgg 105660 gcaaaggtcg gctgaggcag gtctttcttc acgagccatg tgaggatata gggccttatc 105720 ctcagagtaa agggagccat taaaggctcc acaggctgag tgacttgatg aggtgaatac 105780 gaaacagagc atttctgtga atgtgaatgg cctttggagc agaactaagt tcatgaggat 105840 ggaaagaata taatgaagcc atctcttaga cccagaaaga atggggcaca acagccttta 105900 tctttctggg gccaatgtca cagaatgcca tgcttttagg aaacatggta tgttgtgatt 105960 aacacatttt gcagaagtgg gtgggagctt ttgaataata acagtaagca tttgtgcatt 106020 cttcctgtaa tgacattaca gttatgatct gaaaatattg agtcatacat gaattcctgt 106080 tatcttaact cagaaaatat agtccctcac taaaggtttt attttccttc tttttcccat 106140 ttcctttact tcgtataaga aagtcacttg tctcctgggt gcaatggaga cctatgtgag 106200 ttcatagcca agagaatgtt tttggttaga aaaataatag taggaattcc aagctgtgaa 106260 ttttttactg aagctctttg gaaataggat ttggcaagtt ttgtctgcct tcgtcaagta 106320 agcatgagca ggagagcaca gttaatagca ggtgcagaca catgattctc agaccgtatt 106380 ttgtgttcta gtttcaaggc atgaattctt tcctggggtt aattttattc aaggaagtta 106440 tctgtctgtt agatctgata tgtgctcagg ccaacataga ttctttaccc ttcctttctt 106500 cctgctcacc tgtccttcct cttttatctt ttcattgaat taaaaagaaa attatgaaat 106560 agtttcaaca tgaaaaaagg tacagagaat aacataaaga acactcctgg ctgggtgtgg 106620 tggctcacgc ctgcaatccc agcactttgg gagtctgagg cagccagatc actggaggtc 106680 aggagttgga gaccagcttg gccaacatgg tgaaacactg cctctactga aaatacaaga 106740 attagccagg catggtggcg tgcacttgta atcccagcta cgtgagagac tgaggcagga 106800 gaattgcttg aacccaggag ggggaggttg cagtgagctg agatcacacc actgcactct 106860 agcctgggtg acagagtgag actccgtctt caaacaaaca aacaaacaaa aagaacactc 106920 ctgtaccatc atccatcatt ttgccgtgct gactccaggt tctatttaag aaataaaaca 106980 ttacaggtac agctgatgcc acctctgttt ccctagctca ttcttcagag ataactcttg 107040 tcttgcagtt ggatgtttta atcctctata tcantgtata cttacattct atgtataaca 107100 atatttggta ctggcctaaa tgtgttcaca ttgtataagt gtgcatattg gcctgccact 107160 tcatttggaa ttatgttctt gagatttatc aatgttgata catgtggaat ctggttaatt 107220 tttgccatag tattctattt ttatactaaa cttttaaaaa tccatgcttc tagtctttgg 107280 cttatttttt caggttatgg tatgttttgg atgcacagaa aagtaaaatt aagtcatgag 107340 caaaatatct ggataatcca agctttaaac ttgatgtaga atttgaatca tgtgtgtttt 107400 gttaaccctg tgatgtcaat ccatgcctga ttgtgtaact ccaaccaata ttcctttgaa 107460 aatggaaatt tgtttatatt gactacagat tgccaatatt attagtaaat gctgagcact 107520 taatctcgaa taaagaacta gtttaaaaat gattctaaca atggcattga ctgttctacc 107580 ttattactca tgggtgggtt cagccaatgt ttctgttgga gaccaaaacc aaaaacagtc 107640 aaattaaaca agcagtcaaa cccaacatac agactactga taagaaggtc atatcataag 107700 atatggcatt gaatttgtgt ctgctaatgt aaaaatctga tgcccacagc aaacttaata 107760 aggacctatg tttacatcca tgctcaatta cattcctggg ttaaacagtc atgctttagg 107820 ccctgctgtg tgcctggagt tttgctgaag tgtggggctt ttaagagaag gagaataagc 107880 ttgctccaga gttaagaaat ttaaactaaa agtcctaaag atgttggaaa aactattgcc 107940 cttgaagatg taaattcatt aagttggaga agacctttta tacaaacaac agacccattc 108000 actgatttgt acccttcagg agacagatga ccggtaatgg tgacaatggg tgaatgttgg 108060 gtttggggtt tttagaaaca tctgcacttg gtgactactg tatctaattg gtgtgacaaa 108120 cctggcaccc catgtgtttg gcaccatctt gggtcctact cagggccagg tgaaccgagt 108180 ggcctcttca ctgcttcaga gtctgaagca gattgtagta tgcggaccag acacagaata 108240 taccaccaag cacgttggcg caaagcatac tgggaaggga ggctttgtga acatggtgct 108300 ggttctcaaa cctcagtgct caaaagagtc tcctgaggat tcctggatca cactcttaaa 108360 cttctcattc agtaggtgtt agctggttga gaatctgcat tttttttttt ttttctgaga 108420 ccgagtctca ctctgatgcc caggctggag tgtattggcg ccatcttggc tcactgcaac 108480 ctctgcctcc caggttcaag caattttcct gcctcagcct ctctagtagc tgggattaca 108540 agcacatgcc accatgcctg gctaattttt gtacttttag tagagagggg gtttcgccgt 108600 gttggccagg ctcgtcttga actcctgacc tcaggtgatc cacccacttt ggcttcccaa 108660 agtgttgaga ttacaggtgt gagccaccat gcccagcctg aatctgcata tttaacaagc 108720 accacaggtg attctgatac agtagctccc caaacctcac agtgttagtg aatcccagtc 108780 atttacaatt ctgccatgat tttggtcata ttcaagtgca gctggtagca tttttagtta 108840 atatattttt taaattaagt cacttctttt ggataattaa atttaattac aagggaagct 108900 ataccactgc tgtaaaaaca tcacctgctt taaagagaag gtacataatg aatatacatt 108960 aagataaaga tgtatatgtg tgtgtgtgtg cacatataag tatacacata cctaccatag 109020 ggattgagtt ttccttcagg tttttcaact gaaatgtcaa ctttgaggcc agttaatatg 109080 tgtaagatat atgtgtgtat gtatgtctat acatatagac atatacgtaa aaacatacat 109140 ggatgcatat agtatatcta tacacaacct attatgcata tcatgtatat ttcatccact 109200 tagtattatc ttnttatttt gccgtttggc aaatgctcag taaaagaaaa gggttagaag 109260 gggagaaagg cattttatcc caagccttca ggaatcagga tgaggatgtc ttcaccttgt 109320 ggtggggagt anattataca attagagaca gcacattgga gntgtggctg atatgctgtg 109380 tgatgatagc tctagctctc tgcctagcag aggaaggaca tttcaataga agaaaaagtt 109440 taagaccttg ccgagaaaca gagaaaggat gtttgtcttt ttaagaagtt gaaaaccctg 109500 tttgcagaca aaagccctcc agttttggca gtaaactttc atgcaaggga agaaaaaggc 109560 aggggatgac attgttgaca attgtgagga attaccatgt gccaggcact gtgcgagggg 109620 ctttgtacat atcctctagt tttagtgctt ataaaaactc tgtgatatgt gcacagcatt 109680 ttaaactttg ctgcatagtc gagaaaatgg aaggatgggg aatttgagtc atttgcccag 109740 ggttctatag ctaccccagg ttcccatgac tggagaattg gggcacaggg tggcggggga 109800 gagtgagtga caagaatcct aacaatctta tttccattga gtccttataa aagaagtgga 109860 ttaactacca cgtttttaag tttttcttaa atttaggtta tgtggatctg gcgtttcttg 109920 ttttgtcctg ggtttgtttt gtttttgcta tgctgtcttg aacatctgtc atcttgtagg 109980 cctaacggta aacacaaaaa cactttacct cctatagctt tcaattaaga tctctcagtt 110040 tgtgtttgta atagttttcc aggcaagttc tccctaggtt cggcttctag tgtgttaacc 110100 tttagttata aagtgaaccc aaagagagaa agtagaaaca aaacacctca cctgtttttg 110160 ctcatgaatt actctctatg gaaggaacaa tcatgaacac ctctgcgtat cacagaggcc 110220 tatctgagtc tgacgtttaa gggagaccgc gtaggtccct ttgaggactg tgaatgtggg 110280 agtcctggga ctctggtgaa gaacccgttc cagaagagat gaatgagctg gacaagttct 110340 ttcatagaac ctttaggcag gttttcttag aaatgcacat tgaggattat gcttggatat 110400 tgtgatgatc agaatgatac tcaatccctt ctgcatttgg aattctcttt gaaagaaaac 110460 atcccaggca gctatttctc agagatagtg agtcccagcc acttctagac attttcttgt 110520 gtagtctaca ttataatttc acagcagtct ctgatatgac aaatgtcaaa atagcccaac 110580 cttctctaaa cttcagagat gtctgatatg atattgaata aaacaatgct catagaaaca 110640 tcaagaaagg tggattttcc ctggatactt ttttcctgct tgacaaataa cagtgaagaa 110700 actgatctca cgtctttttc tctttggaag cctgaacact cagaacccaa cttgaggctc 110760 ctcagctata gcaattctga cttcacagtc tgtaaattat tgttcttttt tttctttagc 110820 ttatgctttc tgccctaatt tatcttttcc ctgttctaat gaattattgt cctatatctg 110880 ctgtgcagtt aggtgacata taacagcaat taaatatatg aattggtaca tataaagatt 110940 tgactaaaac tcgatgtaaa aataagtgtt ctacattcaa tttccagtgt tagaaacagt 111000 gctgacttga acagagtgac agaattccat ctttccctat ttttgacagc tttaaacttt 111060 atattttctt cctttcttgt gagccgtcat taactttgtt tctcaaagnc cattcccgta 111120 ttacccatct tgcagacgca gacagatttg ggaatttgcg gtcagagttg tattggacac 111180 atccccccag cccacatgag atccttttaa tctattgcat attaactagt tttaagtaca 111240 atattcctac ttcatttaaa accattaatc aaagaatgag tttgaaaatg aacaaaatgc 111300 aaacttacag ttagaaataa ttgtagtgtc tttagttttg gttaggagtc ggtttcttgt 111360 ttgttaaact caagattgtg aacagtttta attcacttgt ttatttccaa tagagatttc 111420 aggtttacat ttgaattcag aaacaaagtt ttctttctca ttacagagaa cactaaactc 111480 tacatctccc ttcccgagca aggagctggc tgaagccaca aaaacattgc tgcatagtct 111540 tgggactctg gcccaggagg taagttgtgt ctttccagta ccaggaagcg gatcatccac 111600 tgtatcagta ttttcattcc tgagtctggc aagaggtcct tttgagttga atatcacatg 111660 ggatgtaata tcaattttca aagtataagt gatgtaaaca ataatgtttt gatttcctta 111720 ttttagaaat gaagaaacct aaaactcata gatgtctcag agctaattgg ttagtggcta 111780 acagctggat atctagttta gaaccttctc cattttttct ttttgcccct aggtaatcat 111840 acatttgtaa agaggagaat tatctctgcc actgcccatg cactgctttt gtctgaccag 111900 caatttctcc atattgcttc ttcagtagca aggccaatca ttttaccaac acacatgctt 111960 gctaactaac aggaataacg tggtacccct aattcagccc tttcccttga aagcatctgg 112020 cttctgaggt tcaactatgg gaatatggtc tcttaatgaa cattaagttg agtttgcctt 112080 ttaggtccac atgttgacaa atgtatcaga gtaatctctg tcctaggatc agagggcctg 112140 taggcacttg caaaagcagt tagctctgac tcccagccag tgcacactcc acctttctga 112200 ctcccagcct tgtctcaaat taggcttgga agcgaggaac tgtctggtgt cccccagcat 112260 aggaagctga gccagggggc agtgctcaca aacaatacag actttaacgt gtaggatatt 112320 ggaaaataat aatttgtggg gaaattgtct cagacttggt ccacccttat ttttagctgc 112380 ttctctaatc cgtttttctt tttttggtgc ttgtatctaa cctacccatt ttttggtgct 112440 tgcatcattt tttcaaatat caaaaacgaa ctttatgttt tctaacaatg aaagtattgc 112500 atgttcattg tggaaaatgc tgaagacttg gaaaatacaa aaatgctgag atcaaacact 112560 attgatacgt tagtgtattt cttcctgtcc tgttctactt tctttctttg aattctgctc 112620 acgtgtttct gactgatgag gtctgacttt tgggttcctt ttccagagga gaagccttct 112680 ttcagcttgc catttgttac cctggttatg aaggctggta acctttttta ctaggtagag 112740 aagctggacc aactggggtt cttccagggg gagaatgaga aagagaaact gttttgcaag 112800 tccgtagcta tttctctagg gccctgttag ctgacattga catgccttgc attgctctgc 112860 agatcccctc gcagccctct gtcccttgtt catttctggc cttagagaaa gcaaagcagg 112920 gtctgtaaca ggggaggctg cctctaaact cagggtttgg ttacagctgt tttcacttac 112980 atcactggcc ctggtttttt ttttttttct ggcattaaaa aaaaaaattg gaagcaggtg 113040 atgttcccat tgctgatgtg gtggaaactc tccaagtgaa caatatacgt ttttcttggc 113100 agctgtttct tgtgccctgc ttgctcctgg tccaggacaa gcaaggacca tctgcctctt 113160 tcaatagaac acctccagat ccctttgatc aaaagttact cattgtctga cttgctattt 113220 ctgtgagata aatgggagaa gatcaataaa tgcacttgtt tgtccagtca gcngtgtgga 113280 aagttgataa ttttgaccaa agcacaaccc ttgaaaggaa aagaaaaagg gagtgaatgt 113340 cttctgagaa gctgcctagg ttcagacagt gtcacccatt tccctgtatg ctccacatga 113400 caaacctgag tgggtctcat catgtccatt ttgcagatgg caccaaggct cagaaaggtt 113460 aggcaacttt tccagtcacc caatgagtta attgacaaaa ctgggattca aacccagaac 113520 tgttggattc caaagcctgt gttgttgcct gcttcgtgaa aaactccagt agcgactgga 113580 atagaaagga gaaccttcca agaaagaaaa tacgcactag cagaacctgg aaattgggag 113640 gaaatgagga cttgaggaat aagatgaatg aaagctgacc tgagtttcac atctgggtga 113700 tgggaaggga ggacagggag gcagcatctc agatgtccac ccagcaccga ccagctgcct 113760 ggcattgcta ggtgttgagg actcagcagt gaacacgcta acttctctgc tttcttgggg 113820 cacgtatagg gtgagagaca gaaacaaaca ggtcagtgta caatgccaca ggagggatat 113880 atgcagtgaa gaaaaagcag ggtaaggggc atagagcatg agaaggtgct ttttttaaag 113940 gggktgatta ggaaagctct ctctaaggtg acagttggac ctgaaggaga tgatagcatg 114000 tctgtggtga gggaaggaaa ctccgaacag gaagaatggc agatacaaag acattgatgc 114060 tagagcatgc ctaaggaatg tgtttaagga ccagggaaag tgagcaagtg gtggggggag 114120 gagaggagct cagagcagga ggaggtgagt gccatacagg cctggcaaga ctttggattc 114180 ctgctgggtg agatgagaat ccagcggagg gcttgaggga ggggacatga tgtgatctag 114240 agtttagact gtttacactc tggttgttgg gttgagaaga gactgggatg ggggaaaggg 114300 aggacaaagg acattgtgct ggattgagaa agcagtaagt cagtttcatt cattcactca 114360 accgatgatg ttcaaatacc accatcatcc gtgggctaaa ggatgaagag ccatccctcc 114420 ctgagagtca ggaagcactt cccagataaa gtttggagtg tgagctgagg tgtaggagaa 114480 agagtaagag tttacccctg aaacgggtgc tgggaagagt caatagtttg gaataactca 114540 ataatttatg gtgcttcttt agaaagattt gctggcttta tgtgggaaga aatttktttt 114600 tttgattggg gagtggtggg ttggtggtga ggctgcctgt ggaaagagaa gtgagtgttt 114660 tgactcactg ttatttaaaa atctctaggg ctgttccaat aagcaacaaa aggcaaaatg 114720 gcctggttct ctgtcccctt tctgtctgta tgcctcgtac aggttatgaa aagaaaaagt 114780 tgggaaaagc tgtccacctc acctaattgt gttcttgtgg agtgtgctag atgccccctc 114840 tctggagaaa aaaaatcctt gtggcctctg acccacctct ggagagccta gttcccttct 114900 ggaggcagaa ggcaaagctt aggacctaga gagtgctgga ccacgccact cacaggaacc 114960 agcaggctgt gaggttgaaa gctaggcata tggagctttc caggctgggt gcagggcctc 115020 gtggcccttc ccctcccctc tgtgctctat agctcagtct tcccaggcgg tgtgaacacg 115080 cagtgacatt tccaggaata cagggattta ttaatgattt cttgtgaaat gtttggaaat 115140 acaaagtact ctataaatat ttcataatag cattggggct gagaactcca caaagtgccg 115200 gaatacattt gcatgtaaga cagaacgctg cctgggtcat tgatgcctgt tgagtggcag 115260 tcacagacac tgcctagggt ttctgactca cgctgttggg actgttctat gcagggcacc 115320 ctcttgtgtg gcataggatt tgtgcctcac cacacactgt tgtagctttg ctgtcttgat 115380 gatgagtaga gggcagtgtc caggccatgg tataagcatc tactgccccc cagggttacc 115440 aaaaccaagc caagttgtgt ctcagcgagc tccgtgaagc atggagaagt tgagtactca 115500 gagacatgac gtgacttttc aaaggctgta agctgacgag ggacatagct agggttcaga 115560 cttgagtttt tctttttctt tttctttttc tttttttttt aagactgagt cttgcttttg 115620 tcgcccaggc tggattgcag tggtgcttgg ctcactgcaa cctctgcctc ccgggttcaa 115680 gcaattctcc tgcctcagcc tccccagtag ctgggattac aggcacctgc caccatgcct 115740 ggccaacatt tttgtatttt tttagtagag atggggtttc accatgttgg ccaggctggt 115800 cttgaactcc tgacctcagg tgatccaccc gcctcgacct cccaaagtac tgggattaca 115860 ggtgtgagcc actgcacccg gcccagactc gagtttttca tcttaatgct ttttcattgc 115920 ctgacacttt actgagacca agatagggaa cttcacatac agtacctttt ctcccaaggc 115980 ggaagagggc tgttcaattt ctacactaga gttcggggag ttttagaaat gagtcagtta 116040 tcgaggatga gagcagttcc tgataggctc aaccacaatg agatgtagct gttcagagaa 116100 agcattcttt tatctataaa ctggaagata atcccggtga aacgaagccc agccccaggg 116160 gcttcactaa ctccaggctg tgcttctcaa actttagtga gcataggaat cacctgggca 116220 tcttgtgaag ctgtagattt gaattctgca ggtcggcaga ggggtctcag aatccgcatt 116280 tccaacaatg tctccagtaa tgctgatgct gctcgtccct ggaccacaga ttgggtagcc 116340 aggttctggc aagctcatcc caaggctttg agatgacatc agacaaaata tgttctggga 116400 catggctttt gagaggtcaa gaaaataaga tgtttctttc tcttctcatc cccaaccctt 116460 gcactgccct tttctccctt cccctaccct cctttctgtc cccatccctg acgccagctg 116520 ttcagcatga gaagctggag tgacatgcga caggaggtga tgtttctgac caatgtgaac 116580 agctccagct cctccaccca aatctaccag gctgtgtctc gtattgtctg cgggcatccc 116640 gagggagggg ggctgaagat caagtctctc aactggtatg aggacaacaa ctacaaagcc 116700 ctctttggag gcaatggcac tgaggaagat gctgaaacct tctatgacaa ctctacaagt 116760 gagtgtccat gcagacccca gccctgtccc caaccccatc cctcccttag ttctggcctt 116820 ggcctgtgtc atctcctccc tctgtagcag cgttagatgt ctacatgccc atttgcccac 116880 cagactgagc tcttcctaga ggagagaggc ttctcttgaa tagctacctg tccccagttc 116940 tctgaatgca gcctggcaca tctcaggtgc acagtagtgt ttatcaatgg aatgaatgat 117000 tgacagccaa ccttctggtt ttctggggga tgtggaaggg tggcttccag ggtgatcaag 117060 aatgagataa tggcagaagg acaaatcctg caagatctca cttatatatg gaatatatgt 117120 aaggtagaaa gtgtcagttt cacatgatga ataagttcct gggatcttga tgtacatcgt 117180 gatgactata gttagtaaca ctgtatagta tacttgaaat ttgctaagag agtagatccg 117240 aagtgttcac actacacaaa aaaggcaact atgaggtgat ggatttatta acagcttgat 117300 tgtggtgatc cttttacaaa gtatacatat attaaaacat cacattgtat accttaaata 117360 tatacaattt ttatttgtca gttgtaactc aaaaaagcta gaaaagcatt tttaaaaagg 117420 atgatgtact ggtcttaata ttaccattga gataagcttt ataataacat aaaaagaaat 117480 aacagtaatg ataatagcaa caacaacaac aacaaagaac taacatttaa gtagaatttc 117540 ttgtgcactg tgcattctgt ttaagttatc tcattttacc ctcatgataa ccctgcaggg 117600 aagattcttt aaccccacat ttcataggct cagagaggtt aagtgccttg gttagagcca 117660 catcagagtt aatccacaag agccaggatt caagcccaaa tctgcctgga tctgtgctct 117720 ctaagataac tgttagtggt ggcgtgtgtg ttctcacact cagacatttg atctgccctt 117780 tgtttcccat tcttagctgc aaggcagtgt taaagaaccc tgtgtctcca tatccactcc 117840 ccacacttaa gcacttttgt gggcccgtgt gccgtatgcc tcgtggcagc agggatccaa 117900 tgtcacagtt ttaggcagtg gcatcctttt ccttgaaaac ttgatgcagg ggaacctttc 117960 tccatttcca accacaggtg tgtcnntttc agacactgag tgnaggcagg ttttgtactt 118020 tattgtaaca caagaacctt ttcttctctg gagtaaagca ctccagacat tcgcaagttg 118080 ctttacaagc cttaaaagga tggtattgta ggcaacttta attaaatccc atctcctcct 118140 ctcccccagc ttgcaagttg acccaaggaa gccttcattt ccatgacaga cttaattgtg 118200 agggcatcct cattaaaaaa aaaaaaattc tattatcttt ccagcatata gaagatactt 118260 ggtatctaaa aatccctgaa aaacttagaa tgaattttta aaaatcaggg atcctgctgg 118320 ataaccaaac ccatttgtct gttacaactt ttgtatttgg gtttttgtta agtgtacata 118380 tactagtttg tgttaattaa agagaatttt tttttttttt tttgagaggg agtctcgctc 118440 cgttgcccag gctggagtgc agtggcgcca tctcggctca ctgcaagctc cgcctcccgg 118500 gttcacgcca ttctcctgcc tcagcctcct gagtagctgg aactgcaggt gcccgccacc 118560 atgcccagct aatttttttt ttttttgtat tttgagtaga gacggggttt cactgtgtta 118620 gccaggatgg tctcgatctc ctgacctcgt gatccgcctg tctcggcctc ccaaagtgct 118680 gggattacag gcgtgaacca ctgcgccctg ttgagaattt tttttttttt ttttgggaga 118740 cagagtttcg ctcttgttgc ccgggctaga gtgcagtgac acaatctcgg ctcactgcaa 118800 cctctgcctc ctgggttcaa gcgattctcc tgcctcagcc tcatgcgtca ccacgcccag 118860 ctaattttgt atttttagta gagacagggt ttctccatgt tggtcaggct ggtctcgaac 118920 tcccaacctc aggtggttcg cccgccttgg cctcccaaag tgctgggatt gcaggcatga 118980 gccactgcgc ccagccccaa attttggttt ttgcttgaaa actgaggtct gaattcagcc 119040 ttctggttgc ccctcaagag tcagtttaaa tgttggtcat gttagttgtc agtgaaaaca 119100 atggtgaggc tggcatgaga gtgtgaatct ggatgggagg gcttgtgctt catgaaaaca 119160 tttttccaga tcagctcagt cgtgagttat ccgtcattga cgttataata agctctgatt 119220 atttatcaag catcattctt tatagatatc tcagtttaat ctgagataat cttctccaca 119280 tctctccaca tagatgttat gaattttact tttacagagg agccaactga ggctcagata 119340 agttacttat tatatgacta gtagtggtag agctggggtt tcaactaaga actctctggc 119400 tccaaagccc ttgtaagttt ctatcagtat atgaccatgc atatgagcat ttgtctctcc 119460 tcttcttcat agctccttac tgcaatgatt tgatgaagaa tttggagtct agtcctcttt 119520 cccgcattat ctggaaagct ctgaagccgc tgctcgttgg gaagatcctg tatacacctg 119580 acactccagc cacaaggcag gtcatggctg aggtaagctg cccccagccc aagactccct 119640 ccccagaatc tccccagaac tgggggcaaa aaactcaagg tagcttcaga ggtgtgcgct 119700 aagtatactc acggctcttc tggaattccc agagtgaaaa cctcaagtct gatgcagacc 119760 agagctgggc cagctcccca gtcgtgggta tagaatcata gttacaagca ggcatttctt 119820 ggggatgggg aggactggca cagggctgct gtgatggggt atcttttcag ggaggagcca 119880 aacgctcatt gtctgtgctt ctcctccttt ttctgcggtc cctggctccc cacctgactc 119940 caggtgaaca agaccttcca ggaactggct gtgttccatg atctggaagg catgtgggag 120000 gaactcagcc ccaagatctg gaccttcatg gagaacagcc aagaaatgga ccttgtccgg 120060 gtgagtgtcc ctcccattat taccatgtgc ctgcttgata ctggagaggt gagtttctgg 120120 tcactttccc aggtgtgagt gaggtgagaa ttctttcagt ttatctagct gggggaatgt 120180 agtgagcata gctaaagtca cagggcacca cctctccaga agtacaggcc atggtgcaga 120240 gataacgctg tgcatatcag catccatgcc actcacggtc aaatagcagt tttctgcaaa 120300 acttagtgag ggctggtgtt tggaagtgga gttgagtaat tgcagtaccc tattttcctt 120360 tttngctgca gcctctcagc cagccacagc atctccctgt gtcttggtag gttttggaaa 120420 gaagtgtggg agcaaaagca tgatgttaca tgtagactgg cctgagatac tcattctcag 120480 ggcactgtgt gaatgatgag ctgctgttac tgtgtggagg ggaaatgcac ttagtgcttc 120540 agagccactt gaaagggata agtgctctag agacaattgg gttcaaatgt ggagcaggct 120600 gagcaagaac agaatgtctc ctttgcctga gcctgagtgc tgttaatcac atcttcctgc 120660 cttgggctga gttagagaat cattagacta tttcctgttt ccatggtgag ggaggcctct 120720 tccttttgtc tctgctcccc ttaagaagca ggtgaggatt ttgccaggtt tcttgttttg 120780 aaccttattg actttaaggg cggctgggtt ttagagactg tacctaccta gggggaacac 120840 ttccgaagtt taggactatt ccctgatccg ctgggaggca ggttactgag gaagtccctt 120900 taaaaacaaa ggagtttata ctgagaaaag cataaacagt gatttgtatg gattcacact 120960 gactaatata gctcatgcca ttaaagtggg gtctcttctc taaaggaggg ttatatgatc 121020 tagccccgta gacctaagtg tggtttcaga cctgttcttc ctggtcctct ccttggaatc 121080 catatttcta ctagttggac tttttctgtt tgtctggctc tcagaggatt ataggaggcc 121140 ctgtgaagtg actcagtgaa ttttgatttg tgggcaagta gatggttccc tagtctgaaa 121200 ttgactttgc cttaggtgct tcaattcttc ataagctccc agttcttaaa ggacaagatc 121260 cttgtaaaca tggcaatggc attcattagg aatctagctg ggaaaatcca gtgtgtatgc 121320 ttggaaatga gggatctggg gctggagaga aaggcatggg catgccttgg agggacttgt 121380 gtgtcaagct gaggaccttt actttaagct ctaggggacc aggcaagggg agatgtagat 121440 acgttactct gatggggtgg atgaattgaa gaaggatgag gcaagaatga aggcagagac 121500 cagggaggag gctctccaag tggccaaggc ataaagcaag aaatgaggcc tggtgactgc 121560 ttagtggcag agcagtgaaa gagagggagg catcaaagtg agtctcgatt tctagctggg 121620 tgggtggtag cgatgtccag taggccagtg gctactgagg tctgcagtgg aggagggtgg 121680 ttgggctgga gacagatgat gagggagtca tcagcctgtg ggtggaagaa aagggaacct 121740 cttccaactg ttttctttgc ttcttccctc tctttctctt tttttttttt tttggacaga 121800 gtcttgctct gtcacccagg ctgaaatgca gtggcatgat cttggctcac cacagcctcc 121860 gcctcctggg ttcaagcaat tctcctgtct cagcctccag agtagctggg attacaggca 121920 catatcactg tgcccggcta atttttgtat tttcagtgga gatgggattt caccatgttg 121980 gtcgggctgg aatgaactcc tgacctcaag tgatccacct gcctcagcct cccaaagtgt 122040 tgggattaca ggcatgagcc accgcgcccg gcctttcttc cctctcttaa agagtgttta 122100 tttaattcca caaacatgag cttgtcaccc cctgtagcct ggcatctcct acacgaggtg 122160 atggctgagg cttctgcttc tgctggggta gctctgatct ttctgctttc tctggcactg 122220 tctacccatg ttgcctcacc ccacaggtcc cagggcacct ctctcgggca agtcttggaa 122280 ccctctgaca ctgatttgct ctcttttctg agctgctttt agccacccat cctcgggacc 122340 tgttttctct ctgcctccac ccctgcgggc agtcttaggt ctcctgcccc tcacgagcac 122400 cccagagagg ccacgtgctc agtgatctca gtgggcgcat ctttctagtc ttgctattct 122460 ttttggccat gttgttcaga aaccatactg ggcagggccg acttcaccct aaaggctgcg 122520 tctcttcact ctgcttttgt ttgttccaaa taaagtggct tcagaattgc taaccctagc 122580 ctctgtgaac ttgtgaggta caattttgtg tctgttatgt taacaaaaat acatacatac 122640 cttcctggtg atggtataaa ttgctattct ctattggaaa gcaatttgga atgaaaattt 122700 aaagaaccat tttaaaatat gctatcctgc gtacctccat tccacccacc cccagggatg 122760 tagcctactg aaataatttt aaagaagtca ccatatgaga gaaaatgtta ttgctatatt 122820 gttattgtga gaaattggaa atagactaaa tgttcagcac tataggaata attaatgaaa 122880 ttacatatac tctatacaat cattatgctg ccattgaaat aataaataca aaggcgcaag 122940 gggggaaaag cttataatgt tagtgaaact aagactgatt tttttataaa gcagcagttt 123000 tcagaccctt ggagactcca attcggtaga accagagctt catcttctct gtcgaagctg 123060 tgacaggagt tgcaaatgcc tctccttttt gctgagtttg cagctgctgt ttttccggca 123120 gcacatctgt gcaggcctct gcctcggccc ctctggnatc tgctgattga gcagcggatt 123180 gatctgtcct tctctttcgt gttgacccat gtgaggaacc aactggcaag gganacaaga 123240 aatggaaata ggcctccttt gcatcatgac ctgtacatcc tgcaattgga aaagattgta 123300 ctttagttgg tttaaccagc agcattattt ttctaaacta agcagtaaga aggaattagg 123360 ttttatgtgg gatcaacaga ctgggtctca aaagaggaag gtgatagaac acagtgggga 123420 gggggaggtg cactagaaac agagggccta tgctttcatt ctggctttgc tacttaatag 123480 ctgtgtgacc caatcttaga gacttaacct ctctgaactt ccattttctc atgtataaaa 123540 tgggaaatat taaaggatac tcactgggct ggtggcttgt gcctgtaatc ccagcacttg 123600 gggaggttga ggtgggagga tcacttgagc ccaggtgttc aagaccagcc caggcaacat 123660 ggcaagactc tgtctctatg aaaaaattaa aaattagcca ggtgtggtgg tgtgcacctg 123720 tagtcttagc tacttggtag gctgagatgg gaggatcact tgggcttggg aggtcaaggc 123780 tgcggtgagc tgtgattcca tcactgcact ccagcccggg cggcagagcg agacactgaa 123840 tccaaacgac aacaacaaca aaaggcaaaa aaataaaagt gccctcttta tggagttgtg 123900 taaggtgaag catatacact attcaacata gtaactatat aaaggaagta ttgttgttgt 123960 tactgtagtt aataccatta agtgagatgt ttcgtatagt ggaaagcaca tggactctga 124020 attcagactg gtctgacttt gagtctcagc tccacatcta gtaatactat gaccaagccc 124080 tggttaaaat catgtttttt tttcttcagc ctcagtcttc tcacatataa aatagggaca 124140 ctgtcattta cctcagtttt ctgtgaggat aaaacaacga cagtgtatat gcaagtattt 124200 tgtaaatttt gtagtgctcc tcaagattta gttggtgttt actacttgta ctttctcact 124260 ggaatggcag atgctgttgg acagcaggga caatgaccac ttttgggaac agcagttgga 124320 tggcttagat tggacagccc aagacatcgt ggcgtttttg gccaagcacc cagaggatgt 124380 ccagtccagt aatggttctg tgtacacctg gagagaagct ttcaacgaga ctaaccaggc 124440 aatccggacc atatctcgct tcatggaggt gaatctgttg ctgggatcat ttagaaaaga 124500 cttaacggct tctttctctg agacgttaca ataaggttca ggcaggaggc aagtttagaa 124560 ataatgtata gtctcattta caaaactatc cctcaagcct aacacaggat ttgataacaa 124620 aaggcactta ataaatgtta gttgagtggt tgaatgagta aataaactct agctttagta 124680 aattaactct agcttattct atataggctc aagagaatat ttctacccat tttcttctag 124740 gttttcctat ctcagtgact aatggtagca aagcattccc ttaaaaaggc attatttgtg 124800 aaacttatct aaaatcgaat tcgggtccaa ttaaattttt gaaattttat attaaaaatt 124860 atattagtag ggatgggtaa gaggtgtttt ggtctggttg gttggttagt tgctatgact 124920 cagaattgct aagaaaacag aaaagtaaga taagatcatt gttttaacct cttttcctcc 124980 acaaaatcaa taaataacat atccctaaat tactcttaga atttctctta aattgcagtg 125040 aaaaaccaaa atccttcatt cttggttgaa ggttggaaaa ctacgttaga gaggattaga 125100 gagagaggat gagcaatcgt gtagtcagcc cttgcctcct agtgtaggat ttgtctcagc 125160 cactgcttgt tgtcctggct gccaacgttc tcatgaaggc tgttcttcta tcagtgtgtc 125220 aacctgaaca agctagaacc catagcaaca gaagtctggc tcatcaacaa gtccatggag 125280 ctgctggatg agaggaagtt ctgggctggt attgtgttca ctggaattac tccaggcagc 125340 attgagctgc cccatcatgt caagtacaag atccgaatgg acattgacaa tgtggagagg 125400 acaaataaaa tcaaggatgg gtaagtggaa tcccatcaca ccagcctggt cttggggagg 125460 tccagagcac ctattatatt aggacaagag gtactttatt ttaactaaaa atttggtaga 125520 aatttcaaca acaacaaaaa aactcaactt ggtgtcatga ttttggtgaa attggtacat 125580 gacttgctgg aaggtttttc ataggtcata aaataacagt atcttttgat ttagcatttc 125640 tactcaaggg aattaattcc aggaattttg gtggcaggca cctgtaatcc cagctactcg 125700 ggaggctgag gcaggagaat tgcttgaacc caggaggcag aggttgcagt gagctaagat 125760 cgcatcattg cactcccgcc tgggcaataa gagtgaaact ccatctcaaa aaaaaaaaaa 125820 gatacaaaaa tagaaaaagg ggcttggtaa gggtagtagg gttttgggca attttttttt 125880 tttttttttt ttttattgta tggttctaaa ggaatggttg attacctgtg gtttggtttt 125940 aggtactggg accctggtcc tcgagctgac ccctttgagg acatgcggta cgtctggggg 126000 ggcttcgcct acttgcagga tgtggtggag caggcaatca tcagggtgct gacgggcacc 126060 gagaagaaaa ctggtgtcta tatgcaacag atgccctatc cctgttacgt tgatgacatg 126120 taagttacct gcaagccact gtttttaacc agtttatact gtgccagatg ggggtgtata 126180 tatgtgtgtg catgtgcatg catgtgtgaa tgatctggaa ataagatgcc agatgtaagt 126240 tgtcaacagt tgcagccaca tgacagacat agatatatgt gcacacacta gtaaacctct 126300 ttccttctca tccatggttg ccacttttat ctttttattt ttattttttt ttttgagatg 126360 gagtctcgct ctgacgccca ggctggagtg cagtggctcg atctcggctc actgcaacct 126420 ttgcctcccg ggttcaagct attctcctgc ctcagcctcc acagtagctg ggactacagg 126480 ctcatgctgc cacgcccggc tgactttttg tattttagta gagacgaggt ttcaccatgt 126540 tacccaggct agacttcaac tcctgagctc aggcaatcca ccctccttgg cctcccaaag 126600 tgctgggatt acaggtgtga gccactgcac ccagcccacc actttaattt tttacactct 126660 acccttttgg tcaaaatttg ctcaatctgc aagcttaaaa tgtgtcatga caaacacatg 126720 caagcacata ctcacacata gatgcagaaa cagcgtctaa acttataaaa gcacagttta 126780 tgtaaatgtg tgcacttctt ctccctaggt ggtaaaccac atttcaaaac aacccaaata 126840 aaactgaaca aagcttcttc ctcttagact ttttagaaaa tctttcagtg ctgagtcact 126900 aagctgccaa gttctcattg tgggaactat gcctttggat gtaatgattt cttctaagac 126960 aatgggcgga ggtgtagtta ttgcagacat ctgaaatatg taatgtttct tccagattct 127020 ggaaattctc ttattctctg tggttggtgg tggtggtggg atgtgtgtgt gtgtgtgtgt 127080 gtgtgtgtgt gtgtgtgtgt agggatcagg atgcgggagg agctgggttc tgcttgtatt 127140 ggttctctgt tttgcattga atagtgtgtt tccttgtatg gctatctata gcttttcaag 127200 gtcaccagaa attatcctgt ttttcacctt ctaaacaatt agctggaatt tttcaaagga 127260 agacttttac aaagacccct aagctaaggt ttactctaga aaggatgtct taagacaggg 127320 cacaggagtt cagaggcatt aagagctggt gcctgttgtc atgtagtgag tatgtgccta 127380 catggtaaag ctttgacgtg aacctcaagt tcagggtcca aaatctgtgt gcctttttac 127440 tttgcacatc tgcattttct attctagctt ggaatctgaa acattgacaa gagctgcctg 127500 aaatgtatgt ctgtggtgtg attagagtta cgataagcaa gtcaatagtg agatgacctt 127560 ggagatgttg aacttttgtg agagaatgag ttgttttttt gttttggttt ttagtacttt 127620 aacataatct acctttagtt taagtatcgc tcacagttac ctagttactg aagcaagccc 127680 ccaaagaaat ttggtttggc aacactttgt tagcctcgtt tttctctcta cattgcattg 127740 ctcgtgaagc attggatcat acgtacattt cagagtctag agggcctgtc cttctgtggc 127800 ccagatgtgg tgctccctct agcatgcagg ctcagaggcc ttggcccatc accctggctc 127860 acgtgtgtct ttctttctcc ccttgtcctt ccttggggcc tccagctttc tgcgggtgat 127920 gagccggtca atgcccctct tcatgacgct ggcctggatt tactcagtgg ctgtgatcat 127980 caagggcatc gtgtatgaga aggaggcacg gctgaaagag accatgcgga tcatgggcct 128040 ggacaacagc atcctctggt ttagctggtt cattagtagc ctcattcctc ttcttgtgag 128100 cgctggcctg ctagtggtca tcctgaaggt aaggcagcct cactcgctct tccctgccag 128160 gaaactccga aatagctcaa cacgggctaa gggaggagaa gaagaaaaaa aatccaagcc 128220 tctggtagag aaggggtcat acctgtcatt tcctgcaatt tcatccattt atagttgggg 128280 aaagtgaggc ccagagaggg gcagtgactt gcccaaggtc aacccagccg ggtagcagct 128340 aagtaggatg agagtgcagg gttcatgctt tccagataac cacatgctca actgtgccat 128400 gctgtctcat tggtagtggt tcatggcagc atctgaaagc tatttatttt cttagatata 128460 ttgggtggcg attcttccta agtttctaag aacaataatc agaaggatat atattgttgc 128520 aggttagact gtctggaagc agaggctgaa atagagtttg atgtatgggt atttatgagg 128580 gctcaatacc tatggaagag atatggaaga tgcaggattg ggcagaggga ggagttgaac 128640 tgtgatatag ggccaacccc gtggggcact ctagagaata tgcagcttgt tggagttgtt 128700 cttcatcgag ctgaaacatc cagccctttg tgctccccca aggcctccct cctgacacca 128760 cctacctcag ccctctcaat caatcactgg atgtgggctg ccctgggaag gtcgtgcccc 128820 agggcctaca tggctctctg ctgctgtgac aaacccagag ttgctgatgc ctgaggccgt 128880 ctactgacag ctgggcaaca aggcttccct gaatggggac tctgggcagt gcagttttgt 128940 gtctgaacca tacattaata tatttatatc cgaattttct ttctctgcaa gcatttcata 129000 taaagacaca tcaggtaaaa ataaatgttt ttgaagcaaa aggagtacaa agagataaga 129060 actaactaat ttaatactag ttaccatctg ttacaaatag ttcctactga ttgccaagga 129120 ctgtttaaac acatcacatg ggcttcttct tctatcctca ctaacccttt taacagacaa 129180 ggaaatgagg ctcaggaagg tcaaggactt tattgaggtt ccacagtagg atacagttct 129240 tgctaaaagc aacccctccc tcatgctctg ttatctaact gcaaggggaa ggtcagtggc 129300 agaggtagtg gtcccatggt tggtgcataa gagctgctct gagacaactg catgctggtg 129360 ggtcctgcag acatgtaccc atcagccgga gataggctca aaatatccac aagagtttgg 129420 atgattgtgg gaatgcagaa tccatggtga tcaagaggga aagtcaagtt gcctggccat 129480 tttccttggc ttttagacag aaaagttacg tgggatatta tctcccacag ctcttctgtg 129540 gtgccaccag tcatagtcct tatataagga gaaaccagtt gaaattacct attgaagaaa 129600 caaagagcaa actcgcccac tgaaatgcgt agaaagccct ggactctgtt gtattcataa 129660 ctctgccatt atttttctgc gtagttttgg gtaagtcact tatcttcttt aggatggtaa 129720 tgatcagttg cctcatcaga aagatgaaca gcattacgcc tctgcattgt ctctaacatg 129780 agtaggaata aaccctgtct tttttctgta gatcatacaa gtgagtgctt gggattgttg 129840 aggcagcaca tttgatgtgt ctcttccttc ccagttagga aacctgctgc cctacagtga 129900 tcccagcgtg gtgtttgtct tcctgtccgt gtttgctgtg gtgacaatcc tgcagtgctt 129960 cctgattagc acactcttct ccagagccaa cctggcagca gcctgtgggg gcatcatcta 130020 cttcacgctg tacctgccct acgtcctgtg tgtggcatgg caggactacg tgggcttcac 130080 actcaagatc ttcgctgtga gtacctctgg cctttcttca gtggctgtag gcatttgacc 130140 ttcctttgga gtccctgaat aaaagcagca agttgagaac agaagatgat tgtcttttcc 130200 aatgggacat gaaccttnag ctctagattc taagctcttt aagggtaagg gcaagcattg 130260 tgttttatta aattgtttac ctttagtctt ctcagtgaat cctggttgaa ttgaattgaa 130320 tggaattttt ccgagagcca gactgcatct tgaactgggc tggggataaa tggcattgag 130380 gaatggcttc aggcaacaga tgccatctct gccctttatc tcccagctct gttggctatg 130440 ttaagctcat gacaaagcca aggccacaaa tagaactgaa aactcttgat gtcagagatg 130500 acctctcttg tcttccttgt gtccagtatg gtgttttgct tgagtaatgt tttctgaact 130560 aagcacaact gaggagcagg tgcctcatcc cacaaattcc tgacttggac acttccttcc 130620 ctcgtacaga gcagggggat atcttggaga gtgtgtgagc ccctacaagt gcaagttgtc 130680 agatgtcccc aggtcactta tcaggaaagc taagagtgac tcataggatg ctcctgttgc 130740 ctcagtctgg gcttcatagg catcagcagc cccaaacagg cacctctgat cctgagccat 130800 ccttggctga gcagggagcc tcagaagact gtgggtatgc gcatgtgtgt gggggaacag 130860 gattgctgag ccttggggca tctttggaaa cataaagttt taaaagtttt atgcttcact 130920 gtatatgcat ttctgaaatg tttgtatata atgagtggtt acaaatggaa tcattttata 130980 tgttacttgg tagcccacca ctnccctaaa gggactctat aggtaaatac tacttctgca 131040 ccttatgatt gatccatttt gcaaattcaa atttctccag gtataattta cactagaaga 131100 gatagaaaaa tgagactgac caggaaatgg ataggtgact ttgcctgttt ctcacagagc 131160 ctgctgtctc ctgtggcttt tgggtttggc tgtgagtact ttgccctttt tgaggagcag 131220 ggcattggag tgcagtggga caacctgttt gagagtcctg tggaggaaga tggcttcaat 131280 ctcaccactt cggtctccat gatgctgttt gacaccttcc tctatggggt gatgacctgg 131340 tacattgagg ctgtctttcc aggtacactg ctttgggcat ctgtttggaa aatatgactt 131400 ctagctgatg tcctttcttt gtgctagaat ctctgcagtg catgggcttc cctgggaagt 131460 ggtttgggct atagatctat agtaaacaga tagtccaagg acaggcagct gatgctgaaa 131520 gtacaattgt cactacttgt acagcacttg tttcttgaaa actgtgtgcc aggcagcatn 131580 gcaaaatgtt ttatacacat tgcttcattt aattctcaca aggcntactn ctgaagtagt 131640 tactataata accagcaatt ttcaaatgag agaactgtga ctcaaagacg ttaagtaacc 131700 agctttggtc acacaactgt taaatgttgg tacgtggagg tgaatccact tcggttacac 131760 tgggtcaata agcccaggcg aatcctccca atgctcaccc aattctgtat ttctgtgtcc 131820 tcagaggggg tacaactagg agaggttctg tttcctgagt acaggttgtt aataattaaa 131880 tatactagct ctaaggcctg cctgtgattt aattagcatt caataaaaat tcatgttgaa 131940 tttttcttta gtacttcttt cttaatataa tacatcttct tgaccaagtc caagaggaac 132000 ctgcgttgga cagttttcat atgagatcaa attctgagag agcaagattt aacccttttt 132060 ggttcacctt ctgatcctcc cctaaggagg tatacatgaa atatttatta ctcctgcctg 132120 aacttctttc attgaatatg caattttgca gcatgcagat tctggattta aattctgagt 132180 cttaacttac tggctgaggg accttggata ggctccttat ccctcagttt cctcatctct 132240 aaaatgggga tggcacctgc cccgtgggtt gttggaagga cttacagagg tgcagaatgt 132300 acgttgtaca tagcaggttt cagcaaatgt tagctccctc tttccccaca tccattcaaa 132360 tctgttcctt ctccaaagga tgtgtcaagg aggaaatgga cctggctggg aaaccctcag 132420 aatactggga tgatgctgag cttggctcat acctgtgctt tgctttcagg ccagtacgga 132480 attcccaggc cctggtattt tccttgcacc aagtcctact ggtttggcga ggaaagtgat 132540 gagaagagcc accctggttc caaccagaag agaatatcag aaagtaagtg ctgttgacct 132600 cctgctcttt ctttaaccta gtgctgctgc ctctgctaac tgttgggggc aagcgatgtc 132660 tcctgccttt ctaaaagact gtgaaaccac tccaggggca gagaaatcac atgcagtgtc 132720 cctttccaaa tcctcccatg ccatttatgt ccaatgctgt tgacctattg ggagttcacg 132780 gtctcgatcc ctgagggaca ttttctttgt tgtcttggct tctagaagag tatcttttac 132840 ttgccccctc ccaaacacac atttcatggt ctcctaacaa gctagaagaa agaggtaaag 132900 acaagcgtga ttgtggaacc atagcctcgc tgcctgcctg tgacatggtg acctgtgtat 132960 cagcctgtgt gggctgagac caagtggcta ccacagagct cagcctatgc ttcataatgt 133020 aatcattacc cagatcccta atcctctctt ggctcttaac tgcagacaga gatgtccaca 133080 gctcatcaaa ggctctgcct tctgggttct ttgtgcttag agtggcttcc taaatattta 133140 ataggtccct tttctgccag tctcttctgt gcccatcccc tgattgccct tggtaaaagt 133200 atgatgcccc ttagtgtagc acgcttgcct gctgttccta atcatcttct cctacctcct 133260 ctttacacct agctcctgtt tcagtcacct agaaatgctc acagtcgctg gaatatgtca 133320 tgttcttcca cacctccatg cctttgtagg tactgtttgc tctcacagga gaactttctc 133380 tctaacttgc ctatcttctc aactcctcct ttctctccaa gatctagttc cggatcccct 133440 cccctgagca tccctccttg gttctcaggt agtcagtcac tctctgccct gaacttccat 133500 ggcacgtgaa agaaaatctt tttattttaa aacaattaca gactcacaag aagtaataca 133560 aattacatga gggggttccc ttaaaccttt catccagttt ccccaatggt agcagcatgt 133620 gtaactgtag aatagtatca aaaccatgaa attgacatag gtacaattca caaaccttct 133680 tcagatttca ctagctttat gtgcgctcat ttgtgtgtgt gtgtgcgtat ttagttctat 133740 gcaattttat catgtgtgaa ttcatgtaat tactagctca gtcaagctgc agaaatatct 133800 cattgtcaca aagctccttc atgctacccc ttaantggcc acnagccnac ctcccttctt 133860 cctcagttcc tgacacctgt caaccactaa tgcgttcctc gtttttacag ttttattatt 133920 tctagaatgt tacataaatg gaaccataca gtaggtatcc ttttgatact ggcttttttt 133980 tttttttcac tcagcagtat tcccttagat ctatccaagt tgtgtgtgtc aacagttcat 134040 tcctcttcac tgctgagtag tgttccctgg gaggggtgta tcacagttcc atggcatttt 134100 tagatgtatt ttttaaacag ctttcagcat cctctatttt aattgttcat caagtccttt 134160 ttcccaatag actctgaatg ctcctttatc atcgtattcc catcaccaac atcagtaccc 134220 aaataggccc taaataaaca tttatagcct cctgcctgcc tgagaaacca gggtggacat 134280 ggagagaagg cacttctgaa agttcaagcg cagtgcsctg tgtccttaca ctccactcct 134340 cagtgctttc tgtgggttca tttctgtctt ctctcctgtc acagtctgca tggaggagga 134400 acccacccac ttgaagctgg gcgtgtccat tcagaacctg gtaaaagtct accgagatgg 134460 gatgaaggtg gctgtcgatg gcctggcact gaatttttat gagggccaga tcacctcctt 134520 cctgggccac aatggagcgg ggaagacgac caccatgtaa gaagagggtg tggttcccgc 134580 agaatcagcc acaggagggt tctgcagtag agttagaaat ttatacctta ggaaaccatg 134640 ctgatccctg ggccaaggga aggagcacat gaggagttgc cgaatgtgaa catgttatct 134700 aatcatgagt gtctttccac gtgctagttt gctagatgtt atttcttcag cctaaaacaa 134760 gctggggcct cagatgacct ttcccatgta gttcacagaa ttctgcagtg gtcttggaac 134820 ctgcagccac gaaaagatag attacatatg ttggagggag ttggtaattc ccaggaactc 134880 tgtctctaag cagatgtgag aagcacctgt gagacgcaat caagctgggc agctggcttg 134940 attgccttcc ctgcgacctc aaggacctta cagtgggtag tatcaggagg ggtcaggggc 135000 tgtaaagcac cagcgttagc ctcagtggct tccagcacga ttcctcaacc attctaacca 135060 ttccaaaggg tatatctttg gggggtgaca ttcttttcct gttttctttt taatcttttt 135120 ttaaaacata gaattaatat attatgagct tttcagaaga tttttaaaag gcagtcagaa 135180 atcctactac ctaacacaaa aattgttttt atctttgaat aatatgttct tgtttgtcca 135240 ttttccatgc atgcgatgtt aggcatacaa aatacatttt ttaaagaata ctttcattgc 135300 aaattggaaa cttcgtttaa aaaatgctca tactaaaatt ggcatttcta acccataggc 135360 ccacttgtag ttatttaccg aagcaaaagg acagctttgc tttgtgtggg tctggtaggg 135420 ttcattagaa aggaatgggg gcggtgggag ggttggtgtt ctgttctctc tgcagactga 135480 atggagcatc tagagttaag ggtaggtcaa ccctgacttc tgtacttcta aatttttgtc 135540 ctcaggtcaa tcctgaccgg gttgttcccc ccgacctcgg gcaccgccta catcctggga 135600 aaagacattc gctctgagat gagcaccatc cggcagaacc tgggggtctg tccccagcat 135660 aacgtgctgt ttgacatgtg agtaccagca gcacgttaag aataggcctt ttctggatgt 135720 gtgtgtgtca tgccatcatg ggaggagtgg gacttaagca ttttactttg ctgtgttttt 135780 gttttttctt tttttctttt ttattttttt gagatggagt ctcgctctgt agccaggctg 135840 gactgtagtg gcgcgatctc ggctcactgc aaccttggcc tcccaggttc aagcgattct 135900 cctgcctcag cctcccgagt agctgggact ctaggcacac accaccatgc ccagctaatt 135960 tttgtgtttt tagtagagac ggggtttcac catgttggcc aggatggtct caatgtcttg 136020 acctcgtgat ccgcccacct cggtctccca aagtgctggg aacacaggca tgagccactg 136080 tgtctggcca cattttactt tctttgaata tggcaggctc acctccgtga acaccttgag 136140 acctagttgt tctttgattt taggagaagt gggaggtgaa tggttgagct gtagaggtga 136200 catcagccca gccagtggat gggggcttgg gaaacattgc ttcccattat tgtcatgctg 136260 gagggccctt tagcccatcc tctccccccg ccaccctcct tattgaggcc tggagcagac 136320 ttcccagacc tggtagtgct tcagggccct ggtatgatgg acctatattt gctgcttaag 136380 acatttgctc ccactcaggt tgtcccatca gccataaggc ccccagggag cccgtgtgat 136440 ggagcagaga gagacctgag ctctgcaatc ttgggcaagg cttttccctt atgtttcttc 136500 ttatctaaag tgaacagctg gggctcatgt gctccctcct catctaaagt gaacacatgg 136560 ggctcatgtg cagggtcctc cccgctttca gagcctgagg tcccctgagg ctcaggaagg 136620 ctgctccagg tgagtgccga gctgacttct tggtggacgt gctgtgggga cagcccatta 136680 aagaccacat cttggggccc tgaaattgaa agttgtaact gcctggtgca tggtggccag 136740 gcctgctgga aacaggttgg aagcgatctg tcacctttca ctttgatttc ctgagcagct 136800 catgtggttg ctcactgttg ttctaccttg aatcttgaag attatttttc agaaattgat 136860 aaagttattt taaaaagcac ggggagagaa aaatatgccc attctcatct gttctgggcc 136920 aggggacact gtattctggg gtatccagta gggcccagag ctgacctgcc tccctgtccc 136980 caggctgact gtcgaagaac acatctggtt ctatgcccgc ttgaaagggc tctctgagaa 137040 gcacgtgaag gcggagatgg agcagatggc cctggatgtt ggtttgccat caagcaagct 137100 gaaaagcaaa acaagccagc tgtcaggtgc ggcccagagc taccttccct atccctctcc 137160 cctcctcctc cggctacaca catgcggagg aaaatcagca ctgccccagg gtcccaggct 137220 gggtgcggtt ggtaacagaa acttgtccct ggctgtgccc ctaggtcctc tgccttcact 137280 cactgtctgg ggctggtcct ggagtttgtc ttgctctgtt tttttgtagg tggaatgcag 137340 agaaagctat ctgtggcctt ggcctttgtc gggggatcta aggttgtcat tctggatgaa 137400 cccacagctg gtgtggaccc ttactcccgc aggggaatat gggagctgct gctgaaatac 137460 cgacaaggtg cctgatgtgt atttattctg agtaaatgga ctgagagaga gcggggggct 137520 tttgagaagt gtggctgtat ctcatggcta ggcttctgtg aagccatggg atactcttct 137580 gttakcacag aagagataaa gggcattgag actgagattc ctgagaggag atgctgtgtc 137640 tttattcatc tttttgtccc caacatggtg cactaaattt atggttagtt gaaagggtgg 137700 atgcttaaat gaatggaagc ggagaggggc aggaagacga ttgggctctc tggttagaga 137760 tctgatgtgg tacagtatga ggagcacagg caggcttgga gccaactctg gcntggccct 137820 gagacattgg gaaagtcaca acttgcctca ccttctttgc cgataataat agtggtgcnt 137880 tacctcatag aggattaaat taaatgagaa tgcacacaaa ccacctagca caatgcctgg 137940 catatagcaa gttcccaaat aaaatgcnta ctgttcttac ctctgtgagg atgtggtacc 138000 tatatataca aagctttgcc attctagggr tcatagccat acagggtgaa aggtggcttc 138060 caggtctctt ccagtgctta cccctgctaa tatctctcta gtccctgtca ctgtgacaaa 138120 tcagaactga gaggcctcac ctgtcccaca tccttgtgtt tgtgcctggc aggccgcacc 138180 attattctct ctacacacca catggatgaa gcggacgtcc tgggggacag gattgccatc 138240 atctcccatg ggaagctgtg ctgtgtgggc tcctccctgt ttctgaagaa ccagctggga 138300 acaggctact acctgacctt ggtcaagaaa gatgtggaat cctccctcag ttcctgcaga 138360 aacagtagta gcactgtgtc atacctgaaa aaggtgagct gcagtcttgg tgctgggctg 138420 gtgttgggtc tgggcagcca ggacttgctg gctgtgaatg atttctccat ctccacccct 138480 tttgccatgt tgaaaccacc atctccctgc tctgttgccc ctttgaaatc atatcatact 138540 taaggcatgg aaagctaagg ggccctctgc tcccattgtg ctagttctgt tgaatcccgt 138600 tttccttttc ctatgaggca cagagagtga tggagaaggt ccttagagga cattattatg 138660 tcaaagaaaa gagacttgtc aagaggtaag agccttggct ancaaatgac ctggtngttc 138720 ctgctcatta cttttcaatc tcattgacct taacttttaa actataaaac agccaatatt 138780 tattaggcac tgatttcatg ccagagacac tctgggcant gaaagaaagt aatgataata 138840 gttaatttta tatagcgttg ttaccattta caaccttttt ttttttttta acctctatca 138900 tctcaattaa agtgcagaga gaccctggga agaaggtaac tatatttatt atcccagatg 138960 agggaagtga ggcttgtagg gaattggtag ctgattcaag gtcacccagc aggtaaataa 139020 cagtggtggg accagaccca attaccaggt atgttttcct ctgtaccgca gtacatgcct 139080 gagatttatt tgtgtgttga agccagtggt acctaatgta tttacatccc aacctgaaac 139140 tcctatccac ttatttacct tttaatgagc ctcttaactc aagtgcagtc tgaggaccag 139200 cagcatcagg atcacttggg aacttgttag aaattcagca acctgggccc agctcagacc 139260 taccgaatca gaatctgtgc attttaacaa ggttcttgag tggttgaaca cacattaaag 139320 catgagaagc attgaactag acatgtagcc aggtaaaggc cttgcctgag atggttggca 139380 aaggcctcat tgcagcattc attggcaggc cacagttctt ttggcagctc tgcttcctga 139440 cctttcaccc tcaggaagcg aggctgttca cacggcacac acatgccaga cagggtcctc 139500 tgaagccacg gctgccagtg catgtgtccc agggaaagct ttttccttta gttctcacac 139560 aacagagctt cttggaagcc ctccccggcg aaggtgctgg tggctctgcc ttgctccgtc 139620 cctgacccgt tctcacctcc ttctttgcca tcaggaggac agtgtttctc agagcagttc 139680 tgatgctggc ctgggcagcg accatgagag tgacacgctg accatcggta aggactctgg 139740 ggtttcttat tcaggtggtg cctgagcttc ccccagctgg gcagagtgga ggcagaggag 139800 gagaggtgca gaggctggtg gcgctgactc aaggtttgct gctgggctgg ggctgggtgg 139860 ctgcgggkgt gggagcagct tggtggcggg ttggcctaat gcttgctggg gtgcctgggg 139920 ctcggtttgg gagctagcag ggcagtgtcc cagagagctg agatgattgg ggtttgggga 139980 atcccttagg ggagtggaca ctgaatacca gggatgagga gctgagggcc aagccaggag 140040 ggtgggattt gagcttagta cataagaaga gtgagagccc aggagatgag gaacagcctt 140100 ccagattttt cttgggtagc gtgtgtagga ggccagtgtc accagtagca tatgtggaac 140160 agaagtcttg acccttgcta tctctgccta gtcctaatgg ctggcttttc ccaggaaggc 140220 ttctgcttcc atggactgtt agattaaccc tttatttagg taaatgaggg aacctacttt 140280 ataagcatag gaaagggtga agaatctttt aagattcctt tactcaagtt ttcttttgaa 140340 gaatcccaga gcttaggcaa tagacaccag actttgagcc tcagttatcc attcacccat 140400 ccacccaccc acccacccat ccttccatcc tcccatcctc ccattcaccc atccacccat 140460 ccagctgtcc acccattcta cactgagtac ctataatgtg cctggctttg gtgatacaaa 140520 ggtgaataag acatagtcct ttcctttgcc cccaaccctc agaccagaga tgaacatgtg 140580 gaatgaccta aacacctgga acaggtgtgg tgtatgagcg gcaggcctct gatgagaggg 140640 tgggggatgg ccagccctca ctccgaagcc cctctgagtt gattgagcca tctttgcatt 140700 ctggtccctg cagatgtctc tgctatctcc aacctcatca ggaagcatgt gtctgaagcc 140760 cggctggtgg aagacatagg gcatgagctg acctatgtgc tgccatatga agctgctaag 140820 gagggagcct ttgtggaact ctttcatgag attgatgacc ggctctcaga cctgggcatt 140880 tctagttatg gcatctcaga gacgaccctg gaagaagtaa gttaagtggc tgactgtcgg 140940 aatatatagc aaggccaaat gtcctaaggc cagaccagta gcctgcattg ggagcaggat 141000 tatcatggag ttagtcattg agtttttagg tcatcgacat ctgattaatg ttggccccag 141060 tgagccattt aagatggtag tgggagatag caggaaagaa gtgttttcct ctgtaccaca 141120 gtacatgcct gagatttgtg tgttgaaacc agtggtacct aacacattta catcccaacc 141180 ttaaactcct atgcacttat ttacccttta atgagcctct ttacttaagt acagtgkgag 141240 gaacagcggc atcaggatca cttgggaact tgttagaaat tcagcaactt gggcccagct 141300 cagacctact gaatcagaat caggagcaat tctctggtgt gactgtgtca cagccaggta 141360 tcaactggat tctcatacat aggaaatgac aaacgtttat ggatggatag tctacttgtg 141420 ccaggtgctg agatttgttt tttgtttttt gatttttttt taatcactgt gacctcattt 141480 aattctcaaa aaaagatgaa aaaatgaaca ctcaggaatg ctgacatgag attcagaatc 141540 aggggtttgg ggcttcaaag tccatcctct ctttatccat gtaatgcctc cccttagaga 141600 tacaacatca cagaccttga aggctgaagg ggatataaaa gctgtctggc caagtggtct 141660 ccaagcttga cagtgcagca gaatcacctg gggatattat taaaaataaa catactaagg 141720 tttggcttca gggcctgtga atcagaattt ctggaggtga ggccttgaag tctgtatttc 141780 tattgcatac tttggacaca gtggtctata gactagagtt tggaaatgat tgcgctcatt 141840 cagattctct tctgatgttt gaattgctgc catcatattt ctagtgctct atttcctcct 141900 gctcattctg tcttggataa cttatcatag tactagccta ctcaaagatt tagagccaca 141960 gtcctgaaag aagccacttg actcattccc tgtaggttca gaataaattt cttctgcgca 142020 gtgtctgtca tagctttttt taaatttttt tttatttttg atgagactgg agttttgctc 142080 ttattgccca agctggagtg cagtggtgcg attttggctc actgcaacct ccacctccca 142140 ggttcaagcg attctcctgc ctcagcctcc caagtagctg agattacaag catgtgctac 142200 cacgcccagc taattttgta tttttagtag agatgggttt tatccatgtt ggtcaggctg 142260 gtctcgagct ccagacctca ggtgatctgc ccgcctcggc ctcccaaagt gctgggatta 142320 taggcctgag ccacagcgct cagccataac tttaatttga aaatgattgt ctagcttgat 142380 agctctcacc actgaggaaa tgttctctgg caaaaacggc ttctctccca ggtaactctg 142440 agaaagtgtt attaagaaat gtggcttcta ctttctctgt cttacggggc taacatgcca 142500 ctcagtaata taataatcgt ggcagtggtg actactctcg taatgttggt gcttataatg 142560 ttctcatctc tctcattttc cagatattcc tcaaggtggc cgaagagagt ggggtggatg 142620 ctgagacctc aggtaactgc cttgagggag aatggcacac ttaagatagt gccttctgct 142680 ggctttctca gtgcacgagt attgttcctt tccctttgaa ttgttctatt gcattctcat 142740 ttgtagagtg taggtttgtt gcagatgggg aaggtttgtt ttgttgtaaa taaaataaag 142800 tatgggattc tttccttgtg ccttcagatg gtaccttgcc agcaagacga aacaggcggg 142860 ccttcgggga caagcagagc tgtcttcgcc cgttcactga agatgatgct gctgatccaa 142920 atgattctga catagaccca ggtctgttag ggcaagatca aacagtgtcc tactgtttga 142980 atgtgaaatt ctctctcatg ctctcacctg ttttctttgg atggccttta gccaaggtga 143040 tagatcccta cagagtccaa agagaagtga ggaaatggta aaagccactt gttctttgca 143100 gcatcgtgca tgtgatcaaa cctgaaagag cctatccata tcacttcctt taaagacata 143160 aagatggtgc ctcaatcctc tgaacccatg tatttattat cttttctgcg gggtcctagt 143220 ttcttgtata cattaggtgt ttaattgttg aacaaatatt cattcgagta gatgagtgat 143280 tttgaaagag tcagaaaggg gaatttgctg ttagagttaa ttgtacccta agacttagat 143340 atttgaggct gggcatggtg gctcatgcca gtaatcccag cgctttgaga ggctgaggtg 143400 ggtagatcac ctgaggtcag gagtttgaga ccagtctgac caacaaggtg aaaccccgtc 143460 tctactaaat acaaaaaatt agccgagtgt ggtggcacat gcctgtcatc ccagctactt 143520 gggaggctga ggcaggagaa tcgcttgaac ccaggaggca gaggttgcag tcagccacgg 143580 ttgcgccatt gcactccaga ctgggcaaca agagtgaaaa ctccatctca aaaaagaaaa 143640 aaaaagaatt agatattttg gatgagtgtg tctttgtgtg tttaactgag atggagagga 143700 gagctaagac atcaaacaaa tattgttaag atgtaaaagc acatcagtta ggtatcatta 143760 gtttaggaca aggatttcta gaaaattttt aggaacagaa aactttccag ttctctcacc 143820 cctgctcaaa gagtgtatgg ctcttacatt atatataact gcctgacttc atacagtatc 143880 agtacttaga tcatttgaaa tgtgtccacg ttttaccaaa atataatagg gtgagaagct 143940 gagatgctaa ttgccattgt gtattctcaa atatgtcaag ctacgtacat ggcctgtttc 144000 atagagtagt ctataagaaa ttgatgactt gattcatccg aatggctggc tgtaacacct 144060 ggttacgcat gaacacctct tttcagttgt ctcaagacac ctttcttttc tgtacttatc 144120 agacaaggac tgaaaggcag agactgctac tgttagacat tttgagtcaa gcttttcctt 144180 ggacatagct ttgtcatgaa agccctttac ttctgagaaa cttctagctt cagacacatg 144240 ccttcaagat agttgttgaa gacaccagaa gaaggagcat ggcaatgccg aaaacaccta 144300 agataatagg tgaccttcag tgttggcttc ttgcagaatc cagagagaca gacttgctca 144360 gtgggatgga tggcaaaggg tcctaccagg tgaaaggctg gaaacttaca cagcaacagt 144420 ttgtggccct tttgtggaag agactgctaa ttgccagacg gagtcggaaa ggattttttg 144480 ctcaggtgag acgtgctgtt ttcgccagag actctggctt catgggtggg ctgcaggctc 144540 tgtgaccagt gaaggcagga tagcatcctg gtcaagatat ggatgccgga gccagattta 144600 tctgtatttc aatcccagtt ctattccttg ccagttgtgt atccgctggc aagttacttc 144660 tctatgcctc aatctcctca tctgtaaaat ggggataata atattacctg caatacaggg 144720 ttgttacgaa aataaaaatg aataggtgct tagaatgggg cctgacatta gtaagtgctt 144780 agttttgtgt gtgtatatgt tatttttatt ttggaggaga acataaaaag gacaaagtgt 144840 agaaaaactg gttgggtgta ttcagctgtc ataacatgag agttgttatg cccagatgca 144900 cttgacatgt gaatttatta gaaacatgat ttttctctga gttgatgttt aactcaaact 144960 gatagaaaag ataggtcaga atatagttgg ccaacagaga agacttgtta gactattgtc 145020 tgcatgtcag tgtttgcatg ctaacttgct tagttagaaa ggttaaattt tttcactcta 145080 taaaatcaag aaatatagag aaaaggtctg cagagagtct ttcatttgat gatgtggata 145140 ttgttaagag cgggagtttg gagcatacag agctcaagtt gaatcctgac tttgctactt 145200 attggctata tgaccttggg caagctgctt agtctctctg atcctcagtt acctttgttt 145260 gttgatgatg accattgata acacaaccat aaataatgac aacatagaga tagttctcat 145320 tatagtagtt gttatacaga attattcact caatgttaat tttctgcatt gaaatcccag 145380 aacattagaa ttgggggcat tatttgaatc tttaaggtta taaggaatac atttctcagc 145440 aataaatgga aggagttttg ggttaactta taaagtatac ccaagtcatt tttttttcag 145500 agaagatatg gtagaaagtc ttaggaggtt gaagaaggaa ttggatattt attctttctg 145560 agactatcat gggagataat gactatggtt gtccatgatt ggagccgttg ctgtagagtt 145620 ggttttatta tagtgtagga tttgaatggg ccatgtgttc tcagacctca gattaaaawg 145680 agaaaactga ggccagtggg gagcgtgact tcacatgggt acacttgtgc tagagacaga 145740 accaggattc aggacttctg gctcctggtc ctgggttcat ggcccaatgt agtctttctc 145800 agtcttcagg aggaggaagg gcaggaccca gtgttctgag tcaccctgaa tgtgagcact 145860 atttacttcg tgaacttctt ggcttagtgc ctctgccagg tggccataac ctctggcctt 145920 gtgttgccag agaaaaggtt tagttttcag gctccattgc ttcccagctg ccaagaatgc 145980 cttggtgcag cacagtcata ggccctgcat tcctcattgc cgtgctggtt ggtcggggag 146040 gtgggctgga ctcgtaggga tttgcccctt ggccttgttt ctaacacttg ccgtttcctg 146100 ctgtccccct gccccctcca ctgcctgggt aaagattgtc ttgccagctg tgtttgtctg 146160 cattgccctt gtgttcagcc tgatcgtgcc accctttggc aagtacccca gcctggaact 146220 tcagccctgg atgtacaacg aacagtacac atttgtcagg tatgtttgtc ttctacatcc 146280 caggaggggg taagattcga gcagaccaaa gatgtttacg agggccaagg gaatggactt 146340 cagaattaca cggtggaatg aattttactg ctgcggctca ggtccctgta taagctaata 146400 ctgcatgcat agaacagcag cgaactaacc ctgaataata ggccagtctt ctgttgagcc 146460 tttcagcctc tctcctcttc atcctactgt tgtcaggaac agccacatgt gttttaggtg 146520 aaataatcca cccttgcaaa aatccatgat taagttataa aatatttgga tttgtggagc 146580 tgtgttttaa ttctgtaact gagtcacagg gcacactgtc aaagcataga acctccagag 146640 acttgttttc tgcaaagtat aattcatgta attattatct attctgttat atttgggatg 146700 ttaggtagtg tttgttcttt agataaaaat atcccccact ctgtaacaat acattaaatc 146760 aaagaaaagg acaaaggatt tttctgggtc ttgttagcag gagctttctt cagtcctgaa 146820 agatttgtag acctgtagat gggggaactg tgtcagtgat acaaaaggga agcatttaaa 146880 aaaaaaaaag tatatatata tatatatata tatatatgta atgtgaattg gcctcttttt 146940 ctctaagncc cacattttnc ttcttacata gttcaggttt actttatttt ttcctttccg 147000 gctgctgacc ctgtattgcc cgtagttgtg gaacatagca tgtgtttgtg acctgtgcct 147060 gttatttttg tgctttctag ttgtgcatgc aaagagtaca aagttttctt gccctttctt 147120 ggaaaatcct gcttgtctgt gccaaaggga taattgtgaa agcacttttg aaatacttaa 147180 tgagttgatt ttcttcaaat taaaaaaaat atataaatgt atatgtgtat gtacatgtgt 147240 gtacacatac acacctttat acatacagcc catttaaaac aagctccact ttggagtgct 147300 ctacgtcacc ctgatgccga atacagggcc agagtctgag atccttctgg gtggtttctg 147360 tgttttgttc atttctgttt taagagcctg tcacagagaa atgcttccta aaatgtttaa 147420 tttataaaaa catttttatc tctcgattac tggttttaat gaattactaa gctggctgcc 147480 tctcatgtac ccacagcaat gatgctcctg aggacacggg aaccctggaa ctcttaaacg 147540 ccctcaccaa agaccctggc ttcgggaccc gctgtatgga aggaaaccca atcccgtgag 147600 tgccacttta gccataagca gggcttcttg tgcttgttgc ctggtttgat ttctaatatg 147660 ctgcatttat caactgcatg ccacattgtg accgccagca tttgcccttt gaattattat 147720 tatgttttat ttacaaaaag cgaaggtagt aaccgaacta aattatctag gaacaaacgt 147780 ttggagagtc ttctaacacc gtgcaaagca cgtcattaca gacatttgtt tactgattta 147840 gaaccttaat atttaattta aatangcact ttacacttac tgatgaaatg cttttccttt 147900 ctttctctcc cagcccctgt acttaagtgc ttcaataggc tctcattata tatgattttt 147960 aggttttgct tatcagcttc ttcgctttta taatctgaaa agatggcata tgaattttta 148020 taaaaaggga cactttcttc ttctcaaatt gtatattttt attgtacttt ccttcaaaac 148080 ccccttttaa aaagtaagca gtggataaat aaattcagtg aagcatccat atgaccctta 148140 agtgagtgta ggggaaggga ggtcaccaga tcactgtgag tgaagatggt ggagaggtga 148200 ggatcttatg aggccgtgct caaggctggt agaggtgggt tagtgtttcc aggtttaggc 148260 agaatctcag ctgaggtcat gaaacaacag tgatctctga aaaattatgg caaggtggga 148320 aggtgctgga gaattggaga gggggcaaac ttgactttca agtttcaatg ggaagatagg 148380 tgactctgca caccacagaa cagtgagcat gataacctgt ttatacaagg ttctagagca 148440 gatttctaaa tggatagcta ctgtgtgctt gtttgttctt aattagtatt ggatagttac 148500 taaatacttg ttagtactta gtacataatg ggtggtaaat cctagcagct aatattggtt 148560 cccaaataac cagatgacaa ggatagagaa ggacacagac acggcctatc tggatttcat 148620 ggtgcctttg attttccaca tgaaggttgt gtagggaaga tagaagcatg agatgagatg 148680 ataatatagt tatctggatt catcactggc cagctgaacc atatgaactc atggattgat 148740 gctagcttag gaaggctctg taggagccag aactgggctg agagccagcc catagagaca 148800 aaagaggccc ggccctgaca tcagagggtt caaacatgat gtctgagccc cacctacagt 148860 ctgccggagg tggttggaag gaagagcctt tatccttaca attcttactg aaattcaaat 148920 ttttaggttt tgcaaaaaaa tggtggacct gaaggaaatt tgacaggagc atgtctcagc 148980 tgtatttaaa tttgtctcag ccaatcccct tttgaatgtt cagagtgtaa gcttcaggag 149040 ggcagcgcgt cttagtgtga cttttctggt cagttcaggt gctttaagga gacaattaga 149100 gatcaatctg gaaaacttca tttgaatttt taatacataa gaaaacaata agaaatagtt 149160 aaaaatatat atttatataa tatatatgtg tgtgtgtgtg tgtgtgtgtg tatatatata 149220 tatatnttta tttnatttat ttnnnnnnnn ttttttgaga tggagtctcg ctctgttgcc 149280 caggctggag tgcagtggct caatcttggc tcactgccac ctctgcctcc caggttcaag 149340 tgattctcct acctcagcct cctgagtagc tgggattaca agcatgtgcc accacactgg 149400 ctaatttttc taattttagt agagatggag tttcaccatg ttggacagga tggtcttgaa 149460 ctcctgactt agtgatccac ccgccttcgc ctcccaaagt tctgggatta caggcatgag 149520 ccatcgtgcc tggcaattat atttaatatt taataataag gaaataattg ctgtaacttt 149580 actttaaatt gtggaattct gaaactggaa gggaactgga aatgacttgt tgaatcaaat 149640 cattttaaac ttttattttg ccagtggaaa aaataagccc ccaaaagagc aggggacctg 149700 ctgnatgtcc cacagtaatt cagagctgga gatgaggttg aaggctttgt gtcttatctc 149760 cagggaaaat ttgtagacag cgtagctctn ttatgtgacg agcattctca ccccagtcat 149820 cccccaattc tctactcatt tgagaacata aattggatct tgccagtctc tactcatttt 149880 tcagcacatc gagcataaga tccagactct ttcccaggcc tctctcatct ggctcctctc 149940 ctcctccttt atcattactc ttcttcgtag cttatcctac tccagccatg ctgtcttcct 150000 attattccta aaaartagaa atgcatttct tcctagggcc tttgtacctg cacttgccat 150060 cgcttttgct cagaatgttc tttttgccaa gcttttgccc agcttgttct ccatcattgt 150120 tatgttttgg ctgaaatgtc ttctcttagt aggttcattc tccccagtca ctgtcttttt 150180 attttgcttt attttgggcc atctaaggtt atcttattag tgtatttgtt gttcgtctcc 150240 tccatgggca tacacctcca tgaaggcagg tattttcacc ttaggccctc gaatatactg 150300 gacagcatct ggcacgtagt agatgctcaa cgaatgtttg ttgtgtgagc aaatggttgg 150360 ttgattggat tgaactgagt tcagtatgta aatatttagg gcctctttgc attctatttt 150420 acttatgtat aaaatgatac ataatgatga tataaatgat gtcacagtgt acaaggctgt 150480 tgtgggatca agcaatcaaa tgagatcatg cttgtctttt ccaaatggtg agggaataga 150540 tgcatgtttg tggttgttac ggaatgatcc tgtgctcctg aggcaacaga aaggccaggc 150600 catctctggt aatcctactc ttgctgtctt ccctttgcag agacacgccc tgccaggcag 150660 gggaggaaga gtggaccact gccccagttc cccagaccat catggacctc ttccagaatg 150720 ggaactggac aatgcagaac ccttcacctg catgccagtg tagcagcgac aaaatcaaga 150780 agatgctgcc tgtgtgtccc ccaggggcag gggggctgcc tcctccacaa gtgagtcact 150840 ttcagggggt gattgggcag aaggggtgca ggatgggctg gtagcttccg cttggaagca 150900 ggaatgagtg agatatcatg ttgggagggt ctgtttcagt cttttttgtt ttttgttttt 150960 ttttctgagg cggagtcttg ctctggtcgc ccaggctgga gtgctgtggc atgatcttgc 151020 ctcactgcaa cctccacctc ccaggttcaa gcgattctcc tgcctcagcc tcctgagtag 151080 ctgggattac aggcacgcac caccatgtct ggctaatttt tgtgttttta gtagagatag 151140 ggtttcgccg tgttggctag gctggtctgg aattcctgac ctcaggtgat ccacccgcct 151200 cggcctccca aagtgctggg attacaggcg tgagccacta cgcccagccc tgtttcagtc 151260 tttaactcgc ttcttgtcat aagaaaaagc atgtgagttt tgaggggaga aggtttggac 151320 cacactgtgc ccatgcctgt cccacagcag taaagtcaca ggacagactg tggcaggcct 151380 ggcttccaat cttggctctg caacaaatga gctggtagcc tttgacaggc ctgggcctgt 151440 ttcttcacct ctgaattagg gaggctggac cagaaaactc ctgtggatct tgtcaactct 151500 ggtattctta gagactctgt ttgggaagga gtcctgagcc attttttttt tcttgagaat 151560 ttcaggaaga ggagtgctta tgatagctct ctgctgcttt tatcagcaac caaattgcag 151620 gatgaggaca agcaattcta aatgagtaca ggaactaaaa gaaggcttgg ttaccactct 151680 tgaaaataat agctagtcca ggtgcggggt ggctcacacc tgtaatctca gtattttggg 151740 atgccgaggt ggactgatca cctaaggtca ggagttcgaa accagcttgg ccaatgtggc 151800 gaaaccctgt ctctactaaa aattcaaaaa ttagccaggc atggtggcac atgcctgtaa 151860 tcccagttac ttgggaggct gaagcaggag aattgcttga acctgggagg tggaggtcgc 151920 agggagccaa aattgcgcca ctgtactcca gcctgagcaa cacagcaaaa ctccatatca 151980 aaaaataaaa tgaataaaat aacagctaat ctagtcatca gtataactcc agtgaacaga 152040 agatttatta ggcatagtga atgatggtgc ttcctaaaaa tctcttgact acaaagaatc 152100 tcatttcaat gtttattgtt tagatgttca gaataaattc ttgggaaaga ccttggcttg 152160 gtgtaagtga attaccagtg ccgagggcag ggtgaaccaa gtctcagtgc tggttgactg 152220 agggcagtgt ctgggacctg tagtcaggtt tccggtcaca ctgtggacat ggtcactgtt 152280 gtccttgatt tgttttctgt ttcaattctt gtctataaag acccgtatgc ttggttttca 152340 tgtgatgaca gagaaaacaa aacactgcag atatccttca ggacctgaca ggaagaaaca 152400 tttcggatta tctggtgaag acgtatgtgc agatcatagc caaaaggtga ctttttacta 152460 aacttggccc ctgccktatt attactaatt agaggaatta aagacctaca aataacagac 152520 tgaaacagtg ggggaaatgc cagattatgg cctgattctg tctattggaa gtttaggata 152580 ttatcccaaa ctagaaaaga tgacgagagg gactgtgaac attcagttgt cagcttcaag 152640 gctgaggcag cctggtctag aatgaaaata gaaatggatt caacgtcaaa ttttgccact 152700 tagtagcaac ttgaccaggt aactggttat ccttttaaag ccttagttta tctaaattgt 152760 gatattaatg ttgctcttat aagtttgtca tgaggactaa attaaatggt gtacatagag 152820 tgccttgggt actctctgat gggggactcc atgataattt gtggtctcat ggagggagct 152880 ctgggaaggt ttaggagcct gccttggctc tgcagccttg ggagagcctt ctagcttccc 152940 aggacatggc agcctagtgt tgaatgcttg gctcagcaaa tgtttgttct cgtttccttc 153000 ccatcaactt ggtcagttgg ggtctttcag ttaggagtat ctcagtgact ttaaatggca 153060 tgggcatgct ggagtgatag tgaccatgag tttctaagaa agaagcataa tttctccata 153120 tgtcatccac aattgaaata ttattgttaa ttgaaaaagc ttctaggcca ggcacggtgg 153180 ctcatgcctg taatcccagc actttaggag gccaaggcgg gtggatcact tgaggtcagg 153240 agtttgagac cagcctggcc aacatgggga aaccctgtct ctactaaaaa tacaaaataa 153300 gctgggcgtg gtggtgcgtg cctgtaatcc cagctacttg ggaggctgag gcaggagaat 153360 tgcttgaatc tgggaggcgg aggttgcagt gagctgagtt catgccattg cattccagcc 153420 tgggcaacaa gagcgaaacc atctcccaaa agaaaaaaaa aagaaagaaa aagcttctag 153480 tttggttaca tcttggtcta taaggtggtt tgtaaattgg tttaacccaa ggcctggttc 153540 tcatataagt aatagggtat ttatgatgga gagaaggctg gaagaggcct gaacacaggc 153600 ttcttttctc tagcacaacc ctacaaggcc agctgattct agggttattt ctgtccgttc 153660 cttatatcct caggtggata tttactcctt ttgcatcatt aggaataggc tcagtgcttt 153720 ctttgaactg attttttgtt tctttgtctc tgcagcttaa agaacaagat ctgggtgaat 153780 gagtttaggt aagttgctgt ctttctggca cgtttagctc agggggagga tggtgttgta 153840 ggtgtncttg gattgaagaa agccttgggg attgtttgtc actcacacac ttgtgggtgc 153900 catctcactg tgaggaggac agaagccctg tgaacatgtg gagcacacag gggcacagac 153960 agatttagat taggcctgct ttatagagtt tctgcctaga gcatcatggc tcagtgccca 154020 gcagcccctc cagaggcctc tgaaatattt gatatactga tttccttgag gagaatcaga 154080 aatctcctgc aggtgtctag ggatttcaag taagtagtgt tgtgagggga atacctactt 154140 gtactttccc cccaaaccag attcccgagg cttcttaagg actcaaggac aatttctagg 154200 catttagcac gggactaaaa aggtcttaga ggaaataaga agcgccaaaa ccatctcttt 154260 gcactgtatt tcaacccatt tgtccttctg ggttttgaag gaacaggtgg gactggggac 154320 agaagagttc ttgaagccag tttgtccatc atggaaaatg agataggtga tgtggctacg 154380 tcagggggcc cgaaggctcc ttgttactga tttccgtctt ttctctctgc cttttcccca 154440 agggccagga cccctggatc tctgggcaga gcagacgcag gcccctataa tagccctcat 154500 gctagaargg agccggagcc tgtgtataag gccagcgcag cctactctgg acagtgcagg 154560 gttcccactc tcccaactcc ccatctgctt gcctccagac ccacattcac acmcgagcca 154620 ctgggttgga ggagcatctg tgagatgaaa caccattctt tcctcaatgt ctcagctatc 154680 taactgtgtg tgtaatcagg ccaggtcctc cctgctgggc agaaaccatg ggagttaaga 154740 gattgccaac atttattaga ggaagctgac gtgtaacttc tnngaggcaa aatttagccc 154800 tcctttgaac aggaatttga ctcagtgaac cttgtacaca ctcgcactga gtctgctgct 154860 gatgatactg tgcaccccac tgtctgggtt ttaatgtcag gctgttcttt taggtatggc 154920 ggcttttccc tgggtgtcag taatactcaa gcacttcctc cgagtcaaga agttaatgat 154980 gccatcaaac aaatgaagaa acacctaaag ctggccaagg taaaatatct atcgtaagat 155040 gtatcagaaa aatgggcatg tagctgctgg gatataggag tagttggcag gttaaacgga 155100 tcacctggca gctcattgtt ctgaatatgt tggcatacag agccgtcttt ggcatttagc 155160 gatttgagcc agacaaaact gaattactta gttgtacgtt taaaagtgta ggtcaaaaac 155220 aaatccagag gccaggagct gtggctcatg cctgtaatcc tagcactttg ggaggctgaa 155280 gcgggtggat cacttgaggt caggagttcg agaccagcct ggcctacatg acaaaacccc 155340 gtatctacta aaaatacaaa aaaattagct gggcttggtg gcacacacct gtaatcccag 155400 ctacttggga ggctgaggca ggagaattgc ttgaaccctg taggaagagg ttgtagtgag 155460 ccaagatcgc accgttgcac tccagcctgg gcaacaagag caaaactcca tctcaaaaaa 155520 caaattaaat ccagagattt aaaagctctc agaggctggg cgcggtggct tacacctgtt 155580 atcccagcat tttgggatgc cgaggcgggc aaagcacaag gtcaggagtt tgagaccagc 155640 ctggccaaca tagtgaaacc ctgtctctgc taaaaacata gaaaaattag ccgggcatgg 155700 tggcgtgcgc ctgtaatccc agctactcgg gaggctgagg tgagagaatt rcttgaaccc 155760 gggaggcgga ggttgcagtg agcccagatt gcaccactgc actccagcct gggcgacaga 155820 gcaagactcc atctcaaaaa aagctctcag aacaaccagg tttacaaatt tggtcagttg 155880 gtaaataaac tgggtttcaa acatactttg ctgaaayaat cactgactaa ataggaaatg 155940 aatctttttt tttttttttt taagctggca agctggtctg taggacctga taagtactca 156000 cttcatttct ctgtgtctca ggtttcccat ttttaggtga gaattaaggg gctctgataa 156060 aacagaccct aggattgtgg acagcagtgr tagtcctaga gtccacaagt ctgcttttga 156120 gtgatgggcc catgtatctg gcacatctgc aggcagagcg tggttctggc tcttcagatg 156180 atgccggtgg agcactttga ggagtcctca ccccaccgtg ataaccagac attaaaatct 156240 tggggctttg catcccagga tttctctgtg attccttcta gacttgtggc atcatggcag 156300 catcactgct gtagatttct agtcacttgg ttctcaggag ccgtttattt aatggcttca 156360 catttaattt cagtgaacaa ggtagtggca ttgctcttca cagggccgtc ctgttgtcca 156420 caggttccag attgactgtt gccccttatc tatgtgaaca gtcacaactg aggcaggttt 156480 ctgttgttta caggacagtt ctgcagatcg atttctcaac agcttgggaa gatttatgac 156540 aggactggac accaraaata atgtcaaggt aaaccgctgt ctttgttcta gtagcttttt 156600 gatgaacaat aatccttatg tttcctggag tactttcaac tcatggtaaa gttggcaggg 156660 gcattcacaa cagaaaagag caaactatta actttaccag tgaggcagta cggtgtagtg 156720 tagtgattca gagaatttgc tttgccacca gacataccag gtaaccttga ctaagttact 156780 taacctatct aaacctcagt tycctcatct gtgaaatgga gacagtaatc atagctattt 156840 ccaaactgtt gtgagaattc aatgagttaa aggtataagg tcctcaccac agcgcctgcc 156900 cacatagtca gtgatcacta tgtcctgaac actgtaatta cttcgccata ttctctgatc 156960 atagtgtttt gccttggtat gtgactagaa tttctttctg aggtttatgg gcatggttgg 157020 tgggtatgca cctgcctgca ggagcccggt ttgggggcat taccttgtac ctggtatgtt 157080 ttctttcagg tgtggttcaa taacaagggc tggcatgcaa tcagctcttt cctgaatgtc 157140 atcaacaatg ccattctccg ggccaacctg caaaagggag agaaccctag ccattatgga 157200 attactgctt tcaatcatcc cctgaatctc accaagcagc agctctcaga ggtggctctg 157260 taagtgtggc tgtgtctgta tagatggagt ggggcaaggg agagggttat ggagaagggg 157320 agaaaaatgt gaatctcatt gtaggggaac agctgcagag accgttatat tatgataaat 157380 ctggattgat ccaggctctg ggcagaagtg ataagtttac gaattggctg gttgggcttc 157440 ttgaactgca gaagagaaaa tgacactgat atgtaaaaat cgtaacattt agtgaattca 157500 tataaagtga gttcaaaaat tgttaattaa attataattt aattataagt gtttaatcag 157560 tttgatttgt ttaaaaacca ctgttttaaa tttggtggaa tatgttttta ttagcttgta 157620 tctttaattc ctaaattaag ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 157680 gtgtgtgnnt gtgaagttta aagccaggat gagctagttt aaagtatgca gcctttggag 157740 tcatacagat ctgggtttga atctggtctc taaactttat agatgtatga tattaaatga 157800 ggcagttcat gtaaattgcc aagcccagca ctcagcacag agttgatatt tcacacacat 157860 tagatacctt tcctgtatgt ggagcatggc agttcctgtt tctgctttac tcctacagga 157920 tactaatata ggacactagg atctttatac caagacccca tgtaatgggc ttatgagacc 157980 attcttctta taaaaatctg acagaatttt tgtatgtgtt agatcaatag gctgcatact 158040 gttattttca agttgattta cagccagaaa tattaattta tttgagtagt tacagagtaa 158100 tatttctgct ctcatttagt tttcaagccc cactagtcct ttgtgtgtga aaatttacaa 158160 cttactgctc ttacaaggtc atgaacagtg gaccaaagtg aatgccatta accactctga 158220 cttccttcat tagttttatt gtgacagtgg actcttttga cctcagtaat accagtttgg 158280 catttacatt gtcatatttt tagacttaaa aatgatcatc ttaaccctga ataaaatgtg 158340 tctggtgaac agatgttttt ccttgggctg tgcctcagat atctctgtgt gtgtgtacgt 158400 gtgtgtttgt ctgtgtgtcc atgtcctcac tgattgagcc ctaactgcat caaagacccc 158460 tcagattttc acacgctttt tctctccagg atgaccacat cagtggatgt ccttgtgtcc 158520 atctgtgtca tctttgcaat gtccttcgtc ccagccagct ttgtcgtatt cctgatccag 158580 gagcgggtca gcaaagcaaa acacctgcag ttcatcagtg gagtgaagcc tgtcatctac 158640 tggctctcta attttgtctg ggatatggta aggacacagg cctgctgtat ctttctgatg 158700 tctgtcaggg ccatggattg atatggataa gaaagaaaga gctctggcta tcatcaggaa 158760 atgttccagc tactctaaag atgtatgaaa aagaaatagc cagaggcagg tgatcacttt 158820 catgacacca aacacagcat tgggtaccag agttcatgtc acaccagagg gaaaattctg 158880 tacacaatga tgaaaattaa taccactacc acttaagttc ctatgtgaca actttcccaa 158940 gaatcagaga gatacaagtc aaaactccaa gtcaatgcct ctaacttctc tgatgggttt 159000 taacctccag agtcagaatg ttctttgcct tactaggaaa gccatctgtc atttngaaaa 159060 ctctgtacat tttatcagca gcttatccat ccattgcaaa tantgttttt gtgccagcca 159120 caatatattg cttctatttg gaccaatatg ggggatttga aggaattctg aagttctaat 159180 tatatttcaa ctctacttta caatatctcc ctgaaatata tctccctgta acttctatta 159240 attataagct acacagagca aatctaattc ttctcccacc gaacaagtcc ctggatattt 159300 aaaaataact ctcatactct catttaacct gagtattacc cagataagat gatatatgag 159360 aatacacctt gtaacctccg aagcactgta caaatgtgag caatgatggt ggagatgatg 159420 atgagatctt tgctgtttat accaagcccc ttagactgtg tcactcttct gatccggttg 159480 tccttgtatg gccatgctgt atattgtgaa tgtcccgttt tcaaaagcaa agccaagaat 159540 taaccttgtg ttcaggctgt ggtctgaatg gttatgggtc cagagggagt tgatctttag 159600 ctcacacttc tattactgca gcacaaagat tttgcatttt ggaaggagca ccgtcttact 159660 ggcaacttag tggtaaacca aaacctccat ttcacacaaa tgattgtgaa attcgggtct 159720 ccttcattct atacaaattc atttgatttt tttgaaacta aactttatat ttatccatat 159780 taaattacat gggttttatt tttgttttat cttgattcag taattactcc tttcagtaaa 159840 cacagactga gtgctgtgtg tctgacttat gccaggcata ggtgattcag agatgaaagg 159900 tcaagtccct gaacccatct cttgtcttcc tgggtattat ctgtccctcc ctgctttaga 159960 gctcctgaaa tttgctagaa gcatgtcttc atctaagttg ttgataaaca catcaagtag 160020 gattggactg aggcagagcc ctgtagtctg aagctgcagt tcttctagcg gctgacaagc 160080 cccactatca cttccctgct ggtgctttgc tctgccagct gtgaattctc ataattgtcc 160140 tatcgtcaag tctttatttc tgcattttac tgcttgatac actgtcagga cagactttaa 160200 aattattctc agtgcgatga aacaattctg acattcatgt tatgagcagt tacctcataa 160260 atagattaca tgtgagattg aacttgggca gactataata tagcattaat gatgaaacag 160320 acacagtcat cttcgggaag aagaatagag gcttatttgc tgcctgtgaa attaaaatta 160380 ctctgactgg gaatccatcg ttcagtaagt ttactgagtg tgacaccttg gcttgactgt 160440 tggaaagaca gaaagggcat gtagtttata aaatcagcca aggggaaaat gcttgtcaaa 160500 atgtattgtc gggtattttg attaatagtt tatgtggctt cattaattca gagttactct 160560 ccaatatgtt tatctgccct ttcttgtctg ataatggtga aaacttgtgt gatgcattgt 160620 atatttgatt taggggtgaa ctggatgtct ttgttttcac ttttagtgca attacgttgt 160680 ccctgccaca ctggtcatta tcatcttcat ctgcttccag cagaagtcct atgtgtcctc 160740 caccaatctg cctgtgctag cccttctact tttgctgtat gggtaagtca cctctgagtg 160800 agggagctgc acagtggata aggcatttgg tgcccagtgt cagaaggagg gcagggactc 160860 tcagtagaca cttatctttt tgtgtctcaa caggtggtca atcacacctc tcatgtaccc 160920 agcctccttt gtgttcaaga tccccagcac agcctatgtg gtgctcacca gcgtgaacct 160980 cttcattggc attaatggca gcgtggccac ctttgtgctg gagctgttca ccgacaatgt 161040 gagtcatgca gagagaacac tcctgctggg atgagcatct ctgggagcca gaggacagtg 161100 tttaattgtg atcttattcc acttgtcagt ggtattgaca ctgctgactg ccttgtcctg 161160 tcttcagagt ctgtcttccc tgagaaggca aagcaccttt ctttcttgct gtgccttaca 161220 ttttgctggt caagcctttc agtttctttt gacagttttt tttacttctt tcttttttca 161280 atgttgctct taccaagagt agctcctctg ccttccactt tacacatgag agctgggcga 161340 cgncattcag tcctaaggct tttaccatca cctctcttgg tgtttttatt gtcatctcta 161400 agatcaatgc ctttagcctt gatcataacc ttgaactcta atctcaaatt ctcacttgcc 161460 tagtggattg ctccatttag atagtatata gataccccaa cctggatatg tcctagtttt 161520 ctttcccctt ggaacttaat gcttttcttg ccatccctgt cacactcagt ggcactacca 161580 tccactcggt tgcccaagct ggctcttaga gttatcctag atgcttgctt tgctgttgca 161640 gatttcccac attcaactgg ttatgttgtc agttcttcca ggtatggacc tctaaaataa 161700 ggcttcctct ccattccggt tgtcattgcc tttgtccaaa cacagcacac aaggcctttt 161760 acagttgcac aactcttcct gtccataccc accacaccct ttcccagctg taagcttcag 161820 atgagttgcc tccaaccacc atgctcctgt aggcctggct tgaaatgccc ttcttctgtc 161880 acagggtctg gtagtatatc ccttgccctt caagatttag ctaaaatgtg aagctttcct 161940 tacctgctgg gaggtgttct ctcttttctc tgtgctctca gagtccttag tccatgcctc 162000 cagtacaacg tacatccact tacatggtaa tttcctgttt acatactttt cctactcgga 162060 gtggagtctg tttcttaata attttgcctc tcccatgccc tagcacagtg catccagcgt 162120 atagcccctt attcagttgg tagatatttg gccactgttg ccttgtggga tcataagttc 162180 tgatgtattt gagaagaatt tctaaaattc tgacaaaatc ctgaaactca aatattgacc 162240 cagacatgag caatttgctt ttcaaatgct aagggatttt taatggattt gctttaatta 162300 aatctagcct gtttctaagc tttattcatt atttctccat actcagagca tttctccaga 162360 ttttctaaag aatagaattt tattgctaca tatcatcagc tatgcctgct gctatttaat 162420 tggtatctga attaaaaggt ctggtttgtc cctagagaat caaatttttt cttcactccc 162480 atatttcaga acttgataca tttttaggat aaaccatgaa tgacacccgt ttcttctccc 162540 tcaccctccc ttccctccca tttttttttt tttttttttt tagaagctga ataatatcaa 162600 tgatatcctg aagtccgtgt tcttgatctt cccacatttt tgcctgggac gagggctcat 162660 cgacatggtg aaaaaccagg caatggctga tgccctggaa aggtttggtg agtgaagcag 162720 tggctgtagg atgctttaat ggagatggca ctctgcatag gccttggtac cctgaacttt 162780 gttttggaaa gaagcaggtg actaagcaca ggatgttccc ccacccccat gcccagtgac 162840 agggctcatg ccaacacagc tggttgtggc atgggttttg tgacacaacc atttgtctgt 162900 gtctctgata gcattgagaa aagtgaaagg gcagttttga aggtaaggaa aatagtgtta 162960 tttgcttgga tccactggct catgccactg tctgggttgg ttagaagcac tggaaaagtc 163020 aaaccataac tttgagaatt aggtgatcag ggaatcagaa ggaaagatgc aaactttggc 163080 tcttttaggc gaatcatgtg cctgcagatg aggtcattta ttatctttta cacagtctat 163140 aaaattataa tgtattacat ctttttctac ctttagaatg gttaaaaata tttctccggt 163200 agccatatga ttattattca tccattagat aatatagtca aatgggccat gttatttact 163260 gttcatagaa gaggggcttt ttgcaacttg ggctacaaag gagatatgta aggaatttaa 163320 ggaatggtta catggaacta gatttaattg aatctagtgg tttaattgat tcactaggat 163380 atatgctact gaaaggggaa tctgcttaaa gtgctttctg atatttatta ttactaaaac 163440 ttagaattta ttaaaaatac tgactgtgaa aattacttgg gtcgtttgcc tttttaaaag 163500 gatttttggc atgtctcatt aaaaaaagaa atactagata tcttcagtga agttacaaat 163560 cgaatacaca ttggctctga aattctgatt gatactgggt cataaaaagt tttcccaaat 163620 cagacttgga aagtgatcac tctcttgtta ctcttttttc cttgtcatgg gtgatagcca 163680 tttgtgttta ttggaagatc ggtgaatttt aaggaacata ggcccaaatt tgaggaaggg 163740 ccatggtttt tgatccctcc attctgaccg gatctctgca ttgtgtctac taggggagaa 163800 tcgctttgtg tcaccattat cttgggactt ggtgggacga aacctcttcg ccatggccgt 163860 ggaaggggtg gtgttcttcc tcattactgt tctgatccag tacagattct tcatcaggcc 163920 caggtgagct ttttcttaga acccgtggag cacctggttg agggtcacag aggaggcgca 163980 cagggaaaca ctcaccaatg ggggttgcat tgaactgaac tcaaaatatg tgataaaact 164040 gattttcctg atgtgggcat cccgcagccc cctccctgcc catcctggag actgtggcaa 164100 gtaggtttta taatactacg ttagagactg aatctttgtc ctgaaaaata gtttgaaagg 164160 ttcatttttc ttgttttttc ccccaagacc tgtaaatgca aagctatctc ctctgaatga 164220 tgaagatgaa gatgtgaggc gggaaagaca gagaattctt gatggtggag gccagaatga 164280 catcttagaa atcaaggagt tgacgaaggt gagagagtac aggttacaat agctcatctt 164340 cagttttttt cagctttatg tgctgtaacc cagcagtttg ctgacttgct taataaaagg 164400 gcatgtgttc ccaaaatgta catctatacc aaggttctgt caattttatt ttaaaaacac 164460 catggagact tcttaaagaa ttcttactga gaattctttt gtgatatgaa ttcccattct 164520 cgaatacatt ggttttatat gcttacattt atgtgttagt tattaaaaca tactaatatt 164580 gtatatctag tcaaaactga ggtagagaga ataaatggtt gattttgagt ttgagtttca 164640 tagtccaaaa agctgatata ttgcctgtgt tcaagagggt ctatatcagc cctctagatg 164700 ccagcatctc caaattttac ttttttggaa tctgtacagt atttgcaata tttttattac 164760 aaatttctac tctgtggaat ttaattttta aaatacctgc aatacatata tatgttgaat 164820 agatgaaaaa ttatgtagat rataatgaat gatacggttc taaaaagaca ggttaaaaag 164880 taagttcact tttattttga gcttcagaat cattcagaag ccagtcgcca caaacgcaga 164940 ccaaggctct tggcacatca aatatgccta tggcttaggg ttattgacaa gtcttatgtt 165000 gcagtgtatg tggtttatag tcctgccttc cacagttgct tgggagagct gtgagtcact 165060 gaggcttatg aatgtttaca ttttgtttgt tgcagatata tagaaggaag cggaagcctg 165120 ctgttgacag gatttgcgtg ggcattcctc ctggtgaggt aaagacactt tgtctatatt 165180 gcgtttgtcc ctattagttc agactatctc tacccaatca agcaacgatg ctcgttaaga 165240 ggtaaaagtg gattttaaag gcttctgtat ttatgccagg atggagcaat tagtcatcga 165300 gaagagaggg accctgtatg tcaagagaat gatttcagag aatccaatac aatttaagaa 165360 aaagcatggg gctgggcgca gtgattcact cctgtaatcc cagcactttg ggaggccgag 165420 gtgggcggac tcacgaggtc aggagattga gaccatcctg gccaacatgg tgaaacccca 165480 tctctactat aaatacaaaa attagctggg catagtagtg cattcctgta gtcccagcta 165540 ctcgggaggc tgaggcagga gaattgcttg aacctaggag ggggaggttg cccagattgc 165600 gctgctgcac tccagcctgg tgacagagtg agactcatgt caacaacaaa aacagaaaaa 165660 gcacgcacat ctaaaacatg cttttgtgat ccatttggga tggtgatgac attcaaatag 165720 ttttttaaaa atagattttc tcctttctgg tttccgtttg tgttctttta tgcccttttg 165780 ccagagtagg tggtgcnaat ttggctangc tggctttcat tactgttttt cacnacnatt 165840 aacntttggc ctcaacttga caactcaaat aatatttata aatacagcca cacttaaaat 165900 ggtcccatta tgaaatacat atttaaatat ctatacgatg tgttaaaacc aagaaaatat 165960 ttgattcttc tctgatattt aagaattgaa ggtttgaggt agttacgtgt taggggcatt 166020 tatattcatg tttttagagt ttgcttatac aacttaatct ttccttttca gtgctttggg 166080 ctcctgggag ttaatggggc tggaaaatca tcaactttca agatgttaac aggagatacc 166140 actgttacca gaggagatgc tttccttaac aaaaataggt gagaaaagaa gtggcttgta 166200 ttttgctgca aagactttgt ttttaattta tttaaagaaa taggttgtta tttttgatta 166260 cagtggtatt tttagagttc ataaaaatgt tgaaatatag taaagggtaa agaagcacat 166320 aaaatcatcc atgatttcaa tatctagaga taatcacaat ttacatttcc tttcagtctc 166380 attctcttct tttaacagct ttattcaggt ataatttaca tacaatataa tttgcttgtt 166440 ttttaagagt ataatttagt gatttttggt aaattgagag ttttgcaacc atcaccacaa 166500 tccagtttta gaacttttcc atcaccccac atctgtctta tatacacata taaatgtgcc 166560 atacaattga gatcatactg tatgtagaat ttaaaattag tttttattgt taatgagtgt 166620 attatgaata tttcccagtg ggttacattt cctaagatgt ggaattttac attgctacat 166680 aaaatccccc tatgtacatg tacctataat ttatttaata aattccttat aaatgttgga 166740 cacattagtt tccatttttc actatgtaaa tatgtccctg tatacatctt ttattatttc 166800 ctcaggaaca attcctacaa agtaaattgc cctctctaaa gagcatacaa attgactgag 166860 ccaccgttag gccattttct gagactgcac aggtcacaaa gcaatctgat ctttgggaat 166920 acagctacat tttataggct tcttagataa tgttactcta agtactttaa atatgtgggg 166980 cttctctggg cttttttttt tttgagacgg agtttcactc ttactgccca ggctggagag 167040 caatggcgcg accttggctc actgcaacct ccgcctccca ggttcaagcg attctcctgc 167100 ctcagcctcc tgagtagctg agattacagg tgcccgccac aatgcctgcc taattttttt 167160 gtattttcag tagagatggg gtttcaccat gttggccaga ctggtctcga gctcctgacc 167220 tcaggtgatc cacctgcctc agcctcccaa agttctggga ttacaggcat gagccactgc 167280 gcccggcttc tctggactta ttatgtggag agatagtaca aggcagtggc tttcagagtt 167340 ttttgaccat gaccgttgtg ggaaatacat tttatatctc aacctagtat gtacacacag 167400 acatgtagac acatgtataa cctaaagttt cataaagcag tacctactgt tactaattgt 167460 agtgcactct gctatttctt attctacctt atactgcgtc attaaaaaag tgctggtcat 167520 gacccactaa atttatttcc caaaccacta atgaacaatg actcacaatt tgaacacact 167580 ggacaggggg atagccaata aaattgaaaa gagcaaggaa attaatgtat tcatgatctc 167640 ctctcctgtc tcttacattt ttgcagtagc aatgtaaagg aatcctaaga gaacagacat 167700 tctgggaata gcaggcctag cgctgcacaa ctgctttcct aggcttgctc ctagtaccaa 167760 gctcctgacg catatagcag tggcagtaat aaccagccca tagtaaggtt tgtcacaggg 167820 actggttgta agaactgatt tggttggtat agctgtgagg gcctggcacg gtgtccacgt 167880 gtgcctcaat cctaattctg aaaaaggctg accctggggg tgctaattag atacacagag 167940 aggaatgaat gctgccagaa ggccaagttc atggcaatgc cgctgtggct gaggtgcagt 168000 catcagtctg gaacgtgaac actgaacttc tctcacatgt gattcttcac ttgactggct 168060 tcatagaacc ccaaagccac cccaccacca cataaattgt gtctctaggt tctgtgttgc 168120 tcacactcaa aatttctggg ccttctcatt tggtgcatgt gaatggtgca tatgagtgaa 168180 gtctaggatg gggccttagc gttaaagccc tggggtagtg tgactgagat tgttggtaaa 168240 gaatgtgcag tggttggcat gacctcagaa attctgaaat gggactgcac ctgcagactg 168300 aagtgttcag agagccaggg aggtgcaagg actggggagg gtagaggcag gaaccctgcc 168360 tgccaggaag agctagcatc ctgggggcag aaaggctgtg ctttcaagta gcagcagatg 168420 tattggtatc tttgtaatgg agaagcatac tttacaggaa cattaggcca gattgtctaa 168480 ccagagtatc tctacctgct taaaatctaa gtagttttct tgtcctttgc agtatcttat 168540 caaacatcca tgaagtacat cagaacatgg gctactgccc tcagtttgat gccatcacag 168600 agctgttgac tgggagagaa cacgtggagt tctttgccct tttgagagga gtcccagaga 168660 aagaagttgg caaggtactg tgggcacctg aaagccagcc tgtctccttt ggcatcctga 168720 caatatatac cttatggctt ttccacacgc attgacttca ggctgttttt cctcatgaat 168780 gcagcagcac aaaatgctgg ttctttgtat ctgctttcag ggtggaaacc tgtaacggtg 168840 gtggggcagg gctgggtggg cagagaggga gtgctgctcc caccacacga gtcccttctc 168900 cctgctttgg ctcctcacca gttgtcaggt tatgattata gaatctagtc ctactcagtg 168960 aaagaacttt catacatgta tgtgtaggac agcatgataa aattcccaag ccagaccaaa 169020 gtcaaggtgc tttttatcac tgtaggttgg tgagtgggcg attcggaaac tgggcctcgt 169080 gaagtatgga gaaaaatatg ctggtaacta tagtggaggc aacaaacgca agctctctac 169140 agccatggct ttgatcggcg ggcctcctgt ggtgtttctg gtgagtataa ctgtggatgg 169200 aaaactgttg ttctggcctg agtggaaaac atgactgttc aaaagtccta tatgtccagg 169260 gctgttgtat gattggcttg tcttccccca gggacagcag agcaaccttg gaaaagcaga 169320 gggaagcttc tcccttggca cacactgggg tggctgtacc atgcctgcag atgctcccaa 169380 atagaggcac tccaagcact ttgtttctta gcgtgattga ggctggatat gtgatttgat 169440 ctttctctgg aacattcttt ctaatcatct ttgtgttcat tccctgaaaa tgaagagtgt 169500 ggacacagct ttaaaatccc caaggtagca actaggtcat agttcctnta cacacggata 169560 gatgaaaaac agatcagact gggaagtggc ccttgacctt ttttcttctg tagataagag 169620 cattgatgtt attacgggaa gaagcctttg aggcttttat gtattccacc tcggtctgga 169680 atttgtttct gtaaggctaa cagttgcaat atactagggt aatctgagtg agctggaatt 169740 aaaaaaaaaa aggaatttca ccccaatctt atactgactt caatagaggt ttcagacaaa 169800 aagttgtttt gtatatactt atcagtcatg aaaagataat tacaactaaa tggccttttt 169860 ccttccctat ttatttggag aaatttaatt acataaaaaa gtactcagaa tatttgagtt 169920 tcctgcatca ataagacatt tataataatg accttgttta caaatgaatt tgaaagttac 169980 tctaattctt tgattcatca agaaataact agaatggcaa gttaaaattt aagctgtttc 170040 aaagatgctt ctgcatttaa aaacaaattt atctttgatt ttttttcccc ccagcaaata 170100 agacttattt tattctaatt acaggatgaa cccaccacag gcatggatcc caaagcccgg 170160 cggttcttgt ggaattgtgc cctaagtgtt gtcaaggagg ggagatcagt agtgcttaca 170220 tctcataggt ccgtagtaaa gtcttgggtt cctcactgtg ggatgtttta actttccaag 170280 tagaatatgc gatcattttg taaaaattag aaaatacaga aaagcaaaga gtaaaacaat 170340 tattacctga aattatatat gcatattctt acaaaaatgc aagcccagta taaatactgc 170400 tctttttcac ttaatatatt gtaaacatta ttccaagtca gtgcatttag gtgtcatttc 170460 ttatagctgg atagtattcc attaggatat actcttattt aactattccc ccttttgtag 170520 acatttggat tatttccaac ttgttcacaa ttgtaaacac cactacactg aacagcatca 170580 tccctatatc cacatgtact tgtaacagaa tacaattccc taggaagctg gaatgctgga 170640 agtcatggtg atgttctcat ggttacagag aatctctcta aaactaaaac ctctttctgt 170700 tttaccgcag tatggaagaa tgtgaagctc tttgcactag gatggcaatc atggtcaatg 170760 gaaggttcag gtgccttggc agtgtccagc atctaaaaaa taggtaataa agataatttc 170820 tttgggatag tgcctagtga gaaggcttga tatttattct tttgtgagta tataaatggt 170880 gcctctaaaa taaagggaaa taaaactgag caaaacagta tagtggaaag aatgagggct 170940 ttgaagtccg aactgcattc aaattctgtc tttaccattt actggttctg tgactcttgg 171000 gcaagttact taactactgt aagagttagt ttccctggaa gatctacctc ctagctttgt 171060 gctatagatg aaatgaaaaa aatttacatg tgccagtact ggtgagagcg caagctttgg 171120 agtcaaacac aaatgggttt gcatcctggc cctaccaatt atgagctctg agccatgggc 171180 aagtgactaa ctccctgggc ctcagtttct ctgtaacatc tgtcagactt catgggtcca 171240 ggtgaggatt aaaggagatc atgtatttac agcacatggc atggtgcttc acataaaata 171300 agtatttagt aaatgataac tggttccttc tctcagaaac ttatttctgg gcctgccagg 171360 ggccgccctt tttcatggca caagttgggt tcccagggtt cagtattctt ttaaatagtt 171420 ttctggagat cctccatttg ggtatttttt cctgctttca ggtttggaga tggttataca 171480 atagttgtac gaatagcagg gtccaacccg gacctgaagc ctgtccagga tttctttgga 171540 cttgcatttc ctggaagtgt tctaaaagag aaacaccgga acatgctaca ataccagctt 171600 ccatcttcat tatcttctct ggccaggata ttcagcatcc tctcccagag caaaaagcga 171660 ctccacatag aagactactc tgtttctcag acaacacttg accaagtaag ctttgagtgt 171720 caaaacagat ttacttctca gggtgtggat tcctgccccg acactcccgc ccataggtcc 171780 aagagcagtt tgtatcttga attggtgctt gaattcctga tctactattc ctagctatgc 171840 tttttactaa acctctctga acctgaaaag ggagatgatg cctatgtact ctataggatt 171900 attgtgagaa tttactgtaa taataaccat aaaaactacc atttagtgag cacctaccat 171960 gggccaggca ttttacttgg tgcctaatcc tatttaaatt agataaaaaa gtaccaaata 172020 ggtcctgaca cttaagaagt actcagtaaa tattttcttc cctcttccct ttaatcaaga 172080 ccgtatgtgc caaagtaaat ggatgactga gcagttggtg atgtaggggt ggggggcgat 172140 atagaaagtc agtttttggc cgggcgtggt ggctcatgcc tgtaatccca gcactttggg 172200 aggctgagga gcaggcagat catgaggtca ggagatccag ataatcctgg ccaacagggt 172260 gaaaccccgt ctctactaaa aatacaaaaa ttagctgggc atggtggtgc gcacttgtag 172320 tcccagctac ttgcgaggct gaggcaggag aattgctcga acccaggagg tggaggttac 172380 agtgagccaa ggtctcgcca ctgcactcca gcctggggac agagcaagac cccatttcaa 172440 ggggggaaaa aaagtctatt tttaagttgt tattgctttt ttcaagtatt cttccctcct 172500 tcacacacag ttttctagtt aatccattta tgtaattctg tatgctccta cttgacctaa 172560 tttcaacatc tggaaaaata gaactagaat aaagaatgag caagttgagt ggtatttata 172620 aaggtccatc ttaatctttt aacaggtatt tgtgaacttt gccaaggacc aaagtgatga 172680 tgaccactta aaagacctct cattacacaa aaaccagaca gtagtggacg ttgcagttct 172740 cacatctttt ctacaggatg agaaagtgaa agaaagctat gtatgaagaa tcctgttcat 172800 acggggtggc tgaaagtaaa gaggaactag actttccttt gcaccatgtg aagtgttgtg 172860 gagaaaagag ccagaagttg atgtgggaag aagtaaactg gatactgtac tgatactatt 172920 caatgcaatg caattcaatg caatgaaaac aaaattccat tacaggggca gtgcctttgt 172980 agcctatgtc ttgtatggct ctcaagtgaa agacttgaat ttagtttttt acctatacct 173040 atgtgaaact ctattatgga acccaatgga catatgggtt tgaactcaca cttttttttt 173100 tttttttgtt cctgtgtatt ctcattgggg ttgcaacaat aattcatcaa gtaatcatgg 173160 ccagcgatta ttgatcaaaa tcaaaaggta atgcacatcc tcattcacta agccatgcca 173220 tgcccaggag actggtttcc cggtgacaca tccattgctg gcaatgagtg tgccagagtt 173280 attagtgcca agtttttcag aaagtttgaa gcaccatggt gtgtcatgct cacttttgtg 173340 aaagctgctc tgctcagagt ctatcaacat tgaatatcag ttgacagaat ggtgccatgc 173400 gtggctaaca tcctgctttg attccctctg ataagctgtt ctggtggcag taacatgcaa 173460 caaaaatgtg ggtgtctcca ggcacgggaa acttggttcc attgttatat tgtcctatgc 173520 ttcgagccat gggtctacag ggtcatcctt atgagactct taaatatact tagatcctgg 173580 taagaggcaa agaatcaaca gccaaactgc tggggctgca agctgctgaa gccagggcat 173640 gggattaaag agattgtgcg ttcaaaccta gggaagcctg tgcccatttg tcctgactgt 173700 ctgctaacat ggtacactgc atctcaagat gtttatctga cacaagtgta ttatttctgg 173760 ctttttgaat taatctagaa aatgaaaaga tggagttgta ttttgacaaa aatgtttgta 173820 ctttttaatg ttatttggaa ttttaagttc tatcagtgac ttctgaatcc ttagaatggc 173880 ctctttgtag aaccctgtgg tatagaggag tatggccact gccccactat ttttattttc 173940 ttatgtaagt ttgcatatca gtcatgacta gtgcctagaa agcaatgtga tggtcaggat 174000 ctcatgacan ttatatttga gtttctttca gatcatttag gatactctta atctcacttc 174060 atcaatcaaa tattttttga gtgtatgctg tagctgaaag agtatgtacg tacgtataag 174120 actagagaga tattaagtct cagtacactt cctgtgccat gttattcagc tcactggttt 174180 acaaatatag gttgtcttgt ggttgtagga gcccactgta acaatactgg gcagcctttt 174240 tttttttttt tttaattgca acaatgcaaa agccaagaaa gtataagggt cacaagtcta 174300 aacaatgaat tcnttcaaca gggaaaacag ctagcttgaa aacttgctga aaaacacaac 174360 ttgtgtttat ggcatttagt accttcaaat aattggcttt gcagatattg gataccccat 174420 taaatctgac agtctcaaat ttttcatctc ttcaatcact agtcaagaaa aatataaaaa 174480 caacaaatac ttccatatgg agcatttttc agagttttct aacccagtct tatttttcta 174540 gtcagtaaac atttgtaaaa atactgtttc actaatactt actgttaact gtcttgagag 174600 aaaagaaaaa tatgagagaa ctattgtttg gggaagttca agtgatcttt caatatcatt 174660 actaacttct tccacttttt ccagaatttg aatattaacg ctaaaggtgt aagacttcag 174720 atttcaaatt aatctttcta tattttttaa atttacagaa tattatataa cccactgctg 174780 aaaaagaaaa aaatgattgt tttagaagtt aaagtcaata ttgattttaa atataagtaa 174840 tgaaggcata tttccaataa ctagtgatat ggcatcgttg cattttacag tatcttcaaa 174900 aatacagaat ttatagaata atttctcctc atttaatatt tttcaaaatc aaagttatgg 174960 tttcctcatt ttactaaaat cgtattctaa ttcttcatta tagtaaatct atgagcaact 175020 ccttacttcg gttcctctga tttcaaggcc atattttaaa aaatcaaaag gcactgtgaa 175080 ctattttgaa gaaaacacaa cattttaata cagattgaaa ggacctcttc tgaagctaga 175140 aacaatctat agttatacat cttcattaat actgtgttac cttttaaaat agtaattttt 175200 tacattttcc tgtgtaaacc taattgtggt agaaattttt accaactcta tactcaatca 175260 agcaaaattt ctgtatattc cctgtggaat gtacctatgt gagtttcaga aattctcaaa 175320 atacgtgttc aaaaatttct gcttttgcat ctttgggaca cctcagaaaa cttattaaca 175380 actgtgaata tgagaaatac agaagaaaat aataagccct ctatacataa atgcccagca 175440 caattcattg ttaaaaaaca accaaacctc acactactgt atttcattat ctgtactgaa 175500 agcaaatgct ttgtgactat taaatgttgc acatcattca ttcactgtat agtaatcatt 175560 gactaaagcc atttgtctgt gttttcttct tgtggttgta tatatcaggt aaaatatttt 175620 ccaaagagcc atgtgtcatg taatactgaa ccactttgat attgagacat taatttgtac 175680 ccttgttatt atctactagt aataatgtaa tactgtagaa atattgctct aattcttttc 175740 aaaattgttg catccccctt agaatgtttc tatttccata aggatttagg tatgctatta 175800 tcccttctta taccctaaga tgaagctgtt tktgtgctct ttgttcatca ttggccctca 175860 ttccaagcac tttacgctgt ctgtaatgsg atctattttt gcactggaat atctgagaaa 175920 ttgmamaact agacaaaagt ttcacaacag atttctaagt taaatcattt tcatwaaarr 175980 raaaararaa aaaaaatttt gtatgtcaat aactttatat gaagtattaa aatgcatatt 176040 tctatgttgt aatataatga gtcacaaaat aaagctgtga cagttctgtt ggtctacaga 176100 aatttacttt tgtgcatttg tggcaccacc tactgttgaa gggttataaa gccattagaa 176160 aagtagaggg gaagtgattt ggatcaaaag gaaaaacttt agaaaagatt caaatgttcc 176220 cttaatcata aaagagaact gaggggacta cttgaaaata aaaggttgtt ttgtattttc 176280 atgttggtta agatactgag ntaacwggta ttaagtgtta gaggttttta gataaatatt 176340 ctgcttaatg attatgaagc tgcactgaga tttctgaaaa tgctctgtag ctgagcttat 176400 ttaataaatg ttcacttggt ataggggaag ctacaaaggc agccttcagt gtccttttgt 176460 ttattcaacc aaaaatataa ggacacaatg tagcagttat actgggaagg tgctgggggt 176520 ggtggcaatg gtgagcagga aggcgaagta gatatggaaa cagaaatgat actaatatcg 176580 gtgattcctt ccttttttcc tgtrataagt gctgtgcaga caacatatga gcagtgctga 176640 taaatgtaaa tgtattkttc atagctcatt aagaatcagt ttcagaaaga gatgtctgct 176700 tatttkgctr cttgaagaat ccctgtcaaa cagtcctttt saggaagtac aagaggctgt 176760 ctctatttgt gacctcagga atggctgtga cagtgtcgtg agcagtcctt ttcctgtggc 176820 acagatctga actttgtgtg cagaaaaatc ttggcttcaa gtgagccaag atgccccctg 176880 agcatcagca tcacaacttc atcctcctat cttgaagttc atgttatagt gactttaatg 176940 aaatcataga acactgtttc ttcgtgnaac aatgacgagg gagaggaaaa aactttattg 177000 aaaaataaaa aggcaggtaa tttagatgaa aatatgttac ccatgaggtt ttgtttttgc 177060 tttttgtttt tgtttttgag aaacagaatc tcgctctgtc gtccaggctg gagtgcagcg 177120 gcatgatctt ggctcactgc aacctccgcc tcccgggttc aagcgattct cctcagcttc 177180 ccaagtagct ggtactacag gcatgcgcca ccacaaccag ctaatttttg tatttttagt 177240 agagatgggg tttcactata cgttggccag gctggtctca aactcctgac ctaaggtgat 177300 ccttctgcct tgggctccca aagtgctggg attacaggca tgagccacct tgcctggccc 177360 tacccatgag ccttgactaa aacattcttc tatctgtaga aaagcccaaa agaacttttc 177420 cagattcaaa aaacttggca ctttgtaatg gtaatgttta cattaagtaa aaaaaaaaaa 177480 aaaacccact tagcttcagt tttcaagtgt ttactgtgtt gtcatgcact tcatttaatt 177540 ctcaacacct gccctatgag gtaaaaagta ccattttaca tatgagtaaa ttacagctca 177600 gtggataaga aactcgtcca aaggtacagg ttcagtcaag tggcagaggg ttctttttgt 177660 tgaagttagg tatcagttaa aattgacctt gtaaaatcac atcagcatca atatacatta 177720 atttaacaaa tatttattga actttactgt atgccagata cttctctagg tactaggggg 177780 tacaatgtag aagaaaatag aattcctgct ctaatgaatt tatatttcag tggtgaaaga 177840 tgatgtgtgg acaaacacat ctaatgtatt ttgacagcaa tgggtgctaa gaagaaaata 177900 agacatggta attggataac aatggagaga agtcagagat ggccttttct gaggagtgac 177960 atgctcagaa tcgaataaaa agagcaggag cagcctttca aggtgggtgc gaaggatatt 178020 cccggaagaa agaataagtg ttgcaaaagc cccagtggga aaaagcttgg caagctgagg 178080 tggtaaaaga gctatctgta ctaatgtctc ttgtactgag ttagcaagga caagagtggc 178140 agggggtgga actggagaga tagcagggac cacatctcac aggatctcgc cagccttttt 178200 aaggtatttg gattttattg ttagtgcaac aaggagccac tggagagttt taaacagtag 178260 tggtgtgacc tactgtttca gaaagaacac tggctactat gggaagaaag gacagtagga 178320 agactaatag ataatggtgg attgaactaa gatggtagca acagataggg agctatggta 178380 atgttcagta tccactttgg agatagatcc agcaggactt gctgacagac tgcatgtaag 178440 ggctgaggga aagagagcta gcaaggtgac ttctagtttg tgacctgaac taggtagatg 178500 gttgtgttaa aaatgcagtg ggtatttcag tctgaagctc aggggcaatg atggaaaata 178560 agataaatgt gtgcaagttg gctgggcgtg gtggtgcacg cctataatca cagcactttg 178620 ggaggccgag acgggtggat cacttgaggt caggcgtttg cgaccagcct aacatggtga 178680 aaccccatct ctaataaata caaaaaaatt agctgggcat ggtggtgcat gcctgtaatc 178740 caagctactt gggaggctga gacaagagaa tcgcttgcac ctgggaggcg gaggtttcag 178800 tgagctgaga tcgcaccatt gcactccagc ctgggcaaca agagcgaaac tccatctaaa 178860 aaaaaaagtg tgcaagttat tggtacgtat atatttaagc ctcgggactg aatgagatta 178920 cctagagaga atgaagatag ggaacattct tgagcctgag atcctataac atttagaaga 178980 gaagctagcc agggaaattg agagggactg gccagtgaga gagagtaaaa tctgaagagt 179040 ataatgccat atgttgaagc tcagagaatt attgtttcag gaaggcagaa gttgtcaatt 179100 atgtccaaca ttgctgagaa gatgagtaag gaatgtaaat gagtaaaggc aggaatggct 179160 attgaatttg ttaagacaga ggtccttggt gaccttccta agagtcattt cagcatgaca 179220 aggatgcaga cctaactgga atagattttc agggaagact gagaggcaag gaaggggaga 179280 ctgcaaatat atgcaaatct atatttttgc tttgaaggga aacagaaaac aggacaacag 179340 ctggtttaca gaacttttat ttggattctt tccttgtctg ttaataatat aggctgcacc 179400 tgggcatcat cgtgcccttc ctttttgatt ctgctgtttt gcttttaatc ctacaggcat 179460 ttcttctggc aggagccttg tacacactct gggtggttca tgaccaccac ggacttgccc 179520 tttcaaggca gaagcctgtg attcaaaaac ttccattcct tccaatcgac tattataaaa 179580 ctcattttgt ctagtgccca tactccttta ccttaggatc ctgggccaac tgtcactaaa 179640 gaaaacaaac catttcctcc tcctggggga atgtgcttct ttcaagaagg tgcttagaga 179700 atagacattc taggtgacag cagttatgaa agaggtgtct ctaatgaacc taacacttct 179760 tccaacatac tccaggttca ccctcctttg aagacagaaa gtactacaac ttcagttgcc 179820 accctagtca ggttcatagg atttactcag taggttaaga gatggtgctg agaatgaagt 179880 ccagagtttg agtctacaag tggcatttgc ttctcctttg ggacttctgt ggcagggtgt 179940 ggaaccaaat cagattggtt gcatatctgt ttctcttcct gaaaatacaa ttcaaatacc 180000 ttcatagtct gtgacagcaa cttataacca acagacaatg gtattatttt tatttattga 180060 tggcactccc ccatagccag aacttctact gagtcattca tcgtgtattc caggaatact 180120 ttccaaatgg agtagtgagg cacttccaat aacagttatg taatctgtta acagtggctc 180180 aaaatgacat gtgaaggaat gatgtacttc ttttttcaga ctcacagatc tcatatgtta 180240 attcagtcaa caagtactta gggcttatca catgccaggc acaggtaata tgacaataat 180300 gaaaacaaaa acccttgctg tcatagaatt cggtttctag tgggaagaga catacattaa 180360 caaatcatat ataagtaaaa cattaattat ggtgataaca gaaaaagaaa gcaggggaga 180420 gggatagtgt cttagttcag gttgctgtaa caagagtacc atagacaggg tggcttamac 180480 aacaaacatt tctttctcac agatctagtg gctgggaagt ctaagaccag ggtgccagca 180540 tgaagaagtt ctggtgaagg cttacattct ggtttcctca cagtgagggc tcttcacgcc 180600 cttataaatg cactaattct attacccaca tgacctcatc taaaactaac tcacaaaggc 180660 caaacctcct atgttgagat taaggcttca acataggaat tttgggggga caaaaacatt 180720 cagtctctag cagatagggg ccactgtggg aggggattac aatcttaaat agggttgtca 180780 gggaaggctt catgagggca aagacctgag gaggcatgga aatgatccac gcagatgcat 180840 gtctggggaa gagcattcca ggcagaggca gtgggaatgg accctcaggt aagagcatgt 180900 ttagcaagtt cagagaaaag cctcttaggt caggtctgga gcgagagtgg aaaatgtaac 180960 aggacagatc atgtaaagac ttgtagacca cagttagagg cttggacaat aactctgagg 181020 atgacatgaa ggtattggga gggggtgttg agcagaggag tgatatgatc tggtttatat 181080 tttagattat tctgactgct atgttgaaaa tagggttgag agcagagggg aagggcagca 181140 caccagttaa gaccacttcc caaaaatcca gatgatgatg ggcttgaacc aggagagttg 181200 ctatagggat gtgaagcaat caaattctag atatttggga cctgctgaca ggtacataat 181260 ggtgtatgag agaaaaggca gtgtgagaat gactctgaag tctgtggcct gagcaaaaca 181320 gaaacctgga gttgccaata gctgagataa agtttctggt taaagatcaa gagctcagtt 181380 ttggacaaga ggggtgtgag atttctaata gacattcagg tggaggtgtg gggaacacac 181440 ctggatgtga tgagcctgga attcaaattc acatgtgtga atttgaaaaa agattcagaa 181500 aagaggtgtg gactggagga gctatccaca cacagatagc atttgaagcc acaagactgg 181560 gggtgatgac gcagggagta agtataaata gaaaagagaa gtggtccaag gactcagtcc 181620 ttacatcaga ggtcaaagag acaagatgga gactaggaga tatgaaggct tatgcagtga 181680 gtgtttcgga gaaacgaaga aacttcccag aagaaaggag aggtcaactg ctaacaagtc 181740 aagtgaaatg agaactggga aatgagttga caagagcaat ttggctggcg tgatcagaga 181800 atgtttaaga gataatgtga gggaagaaaa tggaggcagg aagtatgaac tactcttttg 181860 agtcatattg taaaggaaaa aaatggggtg attttactct tttttttttt taaagatagg 181920 aaaaaaaagt atatttatat gctgaggaga agaatccagt tgaaggggaa tactgtggct 181980 tcaggagaca ggaaaactgc ctaagccaca tccttcagca agtgagtgtt caagtggagg 182040 gtttgtcttt gcttggggca cgggcggttc atccacagtg aaaggagaga aggcagggca 182100 tgtggggttg gctgtagaaa gcgggcagac ggggtgggaa cttatgaaga ctcttctgat 182160 tgttattttc tctgttaagt aaaaggaaag gtcattggct aagaatgaag atggacgcgc 182220 aggtgctggg agtcatcgaa gagaatgaaa gaatgaataa aatagagaaa aatacaatgc 182280 tattatataa acctatacat ttaaattact ttcctcttca aattagaccc cctggaaggc 182340 aaatctctgt catgttaagt tttaggaaaa gtcataatct acttggagct aagtatgaag 182400 aataaggtat atgatttaaa ccacataatt ctatttttgc ttaaaccggg gaagccttta 182460 aggtaacgta gagtgtaccc attaaaaaag ataaaaatgt cagcttcccc ttccctttaa 182520 atatagtaag gctcaaaggt aattgatgtt cataacccta aggtactata acagtctgaa 182580 gcaaaatatt aaatgtcttc tgtagtattt tagttaactt ttaattttat tggtaattta 182640 tgtgtagtta accacatact tgtgcctcaa ctatttcaca taaatgtgag atgtcttagg 182700 cctctttgaa ctctgcatgg aaaagaatgc tgaggaggta tgtctgttta agctgccact 182760 gcctgcctct cataaataat atctcacatt gcataatagc ccagataaac tgcagatcgc 182820 ttagagcctg ataaatgaga ggaaactgta cagtttgttt tccaaattgc ccttggacag 182880 ctaattcaca atttactatt ttatagctgt aagattttaa aaatataaaa taacatagct 182940 gtaaatacag attatcagaa gatagtctgt aaaaatgtaa atataatata gctgtaaata 183000 tacactatcg gaagtaccaa atctagtttt agtaggtaga ggcatcaagt aaaatggaaa 183060 tgaaagatgt ttcaaggaaa actccaggac atctgtggca gtactaaaga aaccttcctt 183120 ctcaggttcc cagcatgcta ttttattgat gtaacaacat tttcaataaa gttggtaaat 183180 tatcactata ttactgtctt atagcaacat agcaagagct ttagagatca tataagtata 183240 aaatgtgaat ttaaaaaaac aatgaatatg caggattttt attagggcaa gcgtttccat 183300 aaccataaat atttctttaa aacaaataaa tgtcccaaga tctctgttag tgatccaaac 183360 taagtagaaa ttagtaaaat taattataaa tgaacaattt cagcatataa accaacaagt 183420 cttttctaga tttttaacac tgtgacccaa ttgcattatt ttccaagtta gaatgactaa 183480 taatcaatga atgtaaaagc aataattaat acagatgaca ttgtactttt ccacagtaaa 183540 gaaataaaca atctaatatt tttataaatc ccattttata tcacaaaata acctttacta 183600 agcaaatttt tttaaaatct caggaactat agacatgatg aaaagataga tattttatat 183660 aaataattca aaaatactgt caggcaagga aatgtaaaat ccttatttga gtaaaagaaa 183720 atgctataaa gcaatgagtt atcaaaatac agaagaggta ttctaaaaca aatgaaaaac 183780 caagatgatg aaatagtgac aactacttct aatgtgtaac agatactgaa atgccaaggt 183840 gaaagtgaac tgaattattt cttaaagcag tggagaatat gtaactttca aaaatgcaag 183900 aagcacagca aattaactaa ttaacttacc acctccttca aataaaagcg agaacctctt 183960 gggagaattt aagcaccatt agcagacaca tcttagagc 183999 2 21 DNA homo sapien 2 ggaagtgttc taaaagagaa a 21 3 21 DNA homo sapien 3 agtaaagagg gactagactt t 21 4 21 DNA homo sapien 4 agtaaagagg aactagactt t 21 5 2261 PRT homo sapien 5 Met Ala Cys Trp Pro Gln Leu Arg Leu Leu Leu Trp Lys Asn Leu Thr 1 5 10 15 Phe Arg Arg Arg Gln Thr Cys Gln Leu Leu Leu Glu Val Ala Trp Pro 20 25 30 Leu Phe Ile Phe Leu Ile Leu Ile Ser Val Arg Leu Ser Tyr Pro Pro 35 40 45 Tyr Glu Gln His Glu Cys His Phe Pro Asn Lys Ala Met Pro Ser Ala 50 55 60 Gly Thr Leu Pro Trp Val Gln Gly Ile Ile Cys Asn Ala Asn Asn Pro 65 70 75 80 Cys Phe Arg Tyr Pro Thr Pro Gly Glu Ala Pro Gly Val Val Gly Asn 85 90 95 Phe Asn Lys Ser Ile Val Ala Arg Leu Phe Ser Asp Ala Arg Arg Leu 100 105 110 Leu Leu Tyr Ser Gln Lys Asp Thr Ser Met Lys Asp Met Arg Lys Val 115 120 125 Leu Arg Thr Leu Gln Gln Ile Lys Lys Ser Ser Ser Asn Leu Lys Leu 130 135 140 Gln Asp Phe Leu Val Asp Asn Glu Thr Phe Ser Gly Phe Leu Tyr His 145 150 155 160 Asn Leu Ser Leu Pro Lys Ser Thr Val Asp Lys Met Leu Arg Ala Asp 165 170 175 Val Ile Leu His Lys Val Phe Leu Gln Gly Tyr Gln Leu His Leu Thr 180 185 190 Ser Leu Cys Asn Gly Ser Lys Ser Glu Glu Met Ile Gln Leu Gly Asp 195 200 205 Gln Glu Val Ser Glu Leu Cys Gly Leu Pro Arg Glu Lys Leu Ala Ala 210 215 220 Ala Glu Arg Val Leu Arg Ser Asn Met Asp Ile Leu Lys Pro Ile Leu 225 230 235 240 Arg Thr Leu Asn Ser Thr Ser Pro Phe Pro Ser Lys Glu Leu Ala Glu 245 250 255 Ala Thr Lys Thr Leu Leu His Ser Leu Gly Thr Leu Ala Gln Glu Leu 260 265 270 Phe Ser Met Arg Ser Trp Ser Asp Met Arg Gln Glu Val Met Phe Leu 275 280 285 Thr Asn Val Asn Ser Ser Ser Ser Ser Thr Gln Ile Tyr Gln Ala Val 290 295 300 Ser Arg Ile Val Cys Gly His Pro Glu Gly Gly Gly Leu Lys Ile Lys 305 310 315 320 Ser Leu Asn Trp Tyr Glu Asp Asn Asn Tyr Lys Ala Leu Phe Gly Gly 325 330 335 Asn Gly Thr Glu Glu Asp Ala Glu Thr Phe Tyr Asp Asn Ser Thr Thr 340 345 350 Pro Tyr Cys Asn Asp Leu Met Lys Asn Leu Glu Ser Ser Pro Leu Ser 355 360 365 Arg Ile Ile Trp Lys Ala Leu Lys Pro Leu Leu Val Gly Lys Ile Leu 370 375 380 Tyr Thr Pro Asp Thr Pro Ala Thr Arg Gln Val Met Ala Glu Val Asn 385 390 395 400 Lys Thr Phe Gln Glu Leu Ala Val Phe His Asp Leu Glu Gly Met Trp 405 410 415 Glu Glu Leu Ser Pro Lys Ile Trp Thr Phe Met Glu Asn Ser Gln Glu 420 425 430 Met Asp Leu Val Arg Met Leu Leu Asp Ser Arg Asp Asn Asp His Phe 435 440 445 Trp Glu Gln Gln Leu Asp Gly Leu Asp Trp Thr Ala Gln Asp Ile Val 450 455 460 Ala Phe Leu Ala Lys His Pro Glu Asp Val Gln Ser Ser Asn Gly Ser 465 470 475 480 Val Tyr Thr Trp Arg Glu Ala Phe Asn Glu Thr Asn Gln Ala Ile Arg 485 490 495 Thr Ile Ser Arg Phe Met Glu Cys Val Asn Leu Asn Lys Leu Glu Pro 500 505 510 Ile Ala Thr Glu Val Trp Leu Ile Asn Lys Ser Met Glu Leu Leu Asp 515 520 525 Glu Arg Lys Phe Trp Ala Gly Ile Val Phe Thr Gly Ile Thr Pro Gly 530 535 540 Ser Ile Glu Leu Pro His His Val Lys Tyr Lys Ile Arg Met Asp Ile 545 550 555 560 Asp Asn Val Glu Arg Thr Asn Lys Ile Lys Asp Gly Tyr Trp Asp Pro 565 570 575 Gly Pro Arg Ala Asp Pro Phe Glu Asp Met Arg Tyr Val Trp Gly Gly 580 585 590 Phe Ala Tyr Leu Gln Asp Val Val Glu Gln Ala Ile Ile Arg Val Leu 595 600 605 Thr Gly Thr Glu Lys Lys Thr Gly Val Tyr Met Gln Gln Met Pro Tyr 610 615 620 Pro Cys Tyr Val Asp Asp Ile Phe Leu Arg Val Met Ser Arg Ser Met 625 630 635 640 Pro Leu Phe Met Thr Leu Ala Trp Ile Tyr Ser Val Ala Val Ile Ile 645 650 655 Lys Gly Ile Val Tyr Glu Lys Glu Ala Arg Leu Lys Glu Thr Met Arg 660 665 670 Ile Met Gly Leu Asp Asn Ser Ile Leu Trp Phe Ser Trp Phe Ile Ser 675 680 685 Ser Leu Ile Pro Leu Leu Val Ser Ala Gly Leu Leu Val Val Ile Leu 690 695 700 Lys Leu Gly Asn Leu Leu Pro Tyr Ser Asp Pro Ser Val Val Phe Val 705 710 715 720 Phe Leu Ser Val Phe Ala Val Val Thr Ile Leu Gln Cys Phe Leu Ile 725 730 735 Ser Thr Leu Phe Ser Arg Ala Asn Leu Ala Ala Ala Cys Gly Gly Ile 740 745 750 Ile Tyr Phe Thr Leu Tyr Leu Pro Tyr Val Leu Cys Val Ala Trp Gln 755 760 765 Asp Tyr Val Gly Phe Thr Leu Lys Ile Phe Ala Ser Leu Leu Ser Pro 770 775 780 Val Ala Phe Gly Phe Gly Cys Glu Tyr Phe Ala Leu Phe Glu Glu Gln 785 790 795 800 Gly Ile Gly Val Gln Trp Asp Asn Leu Phe Glu Ser Pro Val Glu Glu 805 810 815 Asp Gly Phe Asn Leu Thr Thr Ser Val Ser Met Met Leu Phe Asp Thr 820 825 830 Phe Leu Tyr Gly Val Met Thr Trp Tyr Ile Glu Ala Val Phe Pro Gly 835 840 845 Gln Tyr Gly Ile Pro Arg Pro Trp Tyr Phe Pro Cys Thr Lys Ser Tyr 850 855 860 Trp Phe Gly Glu Glu Ser Asp Glu Lys Ser His Pro Gly Ser Asn Gln 865 870 875 880 Lys Arg Ile Ser Glu Ile Cys Met Glu Glu Glu Pro Thr His Leu Lys 885 890 895 Leu Gly Val Ser Ile Gln Asn Leu Val Lys Val Tyr Arg Asp Gly Met 900 905 910 Lys Val Ala Val Asp Gly Leu Ala Leu Asn Phe Tyr Glu Gly Gln Ile 915 920 925 Thr Ser Phe Leu Gly His Asn Gly Ala Gly Lys Thr Thr Thr Met Ser 930 935 940 Ile Leu Thr Gly Leu Phe Pro Pro Thr Ser Gly Thr Ala Tyr Ile Leu 945 950 955 960 Gly Lys Asp Ile Arg Ser Glu Met Ser Thr Ile Arg Gln Asn Leu Gly 965 970 975 Val Cys Pro Gln His Asn Val Leu Phe Asp Met Leu Thr Val Glu Glu 980 985 990 His Ile Trp Phe Tyr Ala Arg Leu Lys Gly Leu Ser Glu Lys His Val 995 1000 1005 Lys Ala Glu Met Glu Gln Met Ala Leu Asp Val Gly Leu Pro Ser Ser 1010 1015 1020 Lys Leu Lys Ser Lys Thr Ser Gln Leu Ser Gly Gly Met Gln Arg Lys 1025 1030 1035 1040 Leu Ser Val Ala Leu Ala Phe Val Gly Gly Ser Lys Val Val Ile Leu 1045 1050 1055 Asp Glu Pro Thr Ala Gly Val Asp Pro Tyr Ser Arg Arg Gly Ile Trp 1060 1065 1070 Glu Leu Leu Leu Lys Tyr Arg Gln Gly Arg Thr Ile Ile Leu Ser Thr 1075 1080 1085 His His Met Asp Glu Ala Asp Val Leu Gly Asp Arg Ile Ala Ile Ile 1090 1095 1100 Ser His Gly Lys Leu Cys Cys Val Gly Ser Ser Leu Phe Leu Lys Asn 1105 1110 1115 1120 Gln Leu Gly Thr Gly Tyr Tyr Leu Thr Leu Val Lys Lys Asp Val Glu 1125 1130 1135 Ser Ser Leu Ser Ser Cys Arg Asn Ser Ser Ser Thr Val Ser Tyr Leu 1140 1145 1150 Lys Lys Glu Asp Ser Val Ser Gln Ser Ser Ser Asp Ala Gly Leu Gly 1155 1160 1165 Ser Asp His Glu Ser Asp Thr Leu Thr Ile Asp Val Ser Ala Ile Ser 1170 1175 1180 Asn Leu Ile Arg Lys His Val Ser Glu Ala Arg Leu Val Glu Asp Ile 1185 1190 1195 1200 Gly His Glu Leu Thr Tyr Val Leu Pro Tyr Glu Ala Ala Lys Glu Gly 1205 1210 1215 Ala Phe Val Glu Leu Phe His Glu Ile Asp Asp Arg Leu Ser Asp Leu 1220 1225 1230 Gly Ile Ser Ser Tyr Gly Ile Ser Glu Thr Thr Leu Glu Glu Ile Phe 1235 1240 1245 Leu Lys Val Ala Glu Glu Ser Gly Val Asp Ala Glu Thr Ser Asp Gly 1250 1255 1260 Thr Leu Pro Ala Arg Arg Asn Arg Arg Ala Phe Gly Asp Lys Gln Ser 1265 1270 1275 1280 Cys Leu Arg Pro Phe Thr Glu Asp Asp Ala Ala Asp Pro Asn Asp Ser 1285 1290 1295 Asp Ile Asp Pro Glu Ser Arg Glu Thr Asp Leu Leu Ser Gly Met Asp 1300 1305 1310 Gly Lys Gly Ser Tyr Gln Val Lys Gly Trp Lys Leu Thr Gln Gln Gln 1315 1320 1325 Phe Val Ala Leu Leu Trp Lys Arg Leu Leu Ile Ala Arg Arg Ser Arg 1330 1335 1340 Lys Gly Phe Phe Ala Gln Ile Val Leu Pro Ala Val Phe Val Cys Ile 1345 1350 1355 1360 Ala Leu Val Phe Ser Leu Ile Val Pro Pro Phe Gly Lys Tyr Pro Ser 1365 1370 1375 Leu Glu Leu Gln Pro Trp Met Tyr Asn Glu Gln Tyr Thr Phe Val Ser 1380 1385 1390 Asn Asp Ala Pro Glu Asp Thr Gly Thr Leu Glu Leu Leu Asn Ala Leu 1395 1400 1405 Thr Lys Asp Pro Gly Phe Gly Thr Arg Cys Met Glu Gly Asn Pro Ile 1410 1415 1420 Pro Asp Thr Pro Cys Gln Ala Gly Glu Glu Glu Trp Thr Thr Ala Pro 1425 1430 1435 1440 Val Pro Gln Thr Ile Met Asp Leu Phe Gln Asn Gly Asn Trp Thr Met 1445 1450 1455 Gln Asn Pro Ser Pro Ala Cys Gln Cys Ser Ser Asp Lys Ile Lys Lys 1460 1465 1470 Met Leu Pro Val Cys Pro Pro Gly Ala Gly Gly Leu Pro Pro Pro Gln 1475 1480 1485 Arg Lys Gln Asn Thr Ala Asp Ile Leu Gln Asp Leu Thr Gly Arg Asn 1490 1495 1500 Ile Ser Asp Tyr Leu Val Lys Thr Tyr Val Gln Ile Ile Ala Lys Ser 1505 1510 1515 1520 Leu Lys Asn Lys Ile Trp Val Asn Glu Phe Arg Tyr Gly Gly Phe Ser 1525 1530 1535 Leu Gly Val Ser Asn Thr Gln Ala Leu Pro Pro Ser Gln Glu Val Asn 1540 1545 1550 Asp Ala Ile Lys Gln Met Lys Lys His Leu Lys Leu Ala Lys Asp Ser 1555 1560 1565 Ser Ala Asp Arg Phe Leu Asn Ser Leu Gly Arg Phe Met Thr Gly Leu 1570 1575 1580 Asp Thr Arg Asn Asn Val Lys Val Trp Phe Asn Asn Lys Gly Trp His 1585 1590 1595 1600 Ala Ile Ser Ser Phe Leu Asn Val Ile Asn Asn Ala Ile Leu Arg Ala 1605 1610 1615 Asn Leu Gln Lys Gly Glu Asn Pro Ser His Tyr Gly Ile Thr Ala Phe 1620 1625 1630 Asn His Pro Leu Asn Leu Thr Lys Gln Gln Leu Ser Glu Val Ala Leu 1635 1640 1645 Met Thr Thr Ser Val Asp Val Leu Val Ser Ile Cys Val Ile Phe Ala 1650 1655 1660 Met Ser Phe Val Pro Ala Ser Phe Val Val Phe Leu Ile Gln Glu Arg 1665 1670 1675 1680 Val Ser Lys Ala Lys His Leu Gln Phe Ile Ser Gly Val Lys Pro Val 1685 1690 1695 Ile Tyr Trp Leu Ser Asn Phe Val Trp Asp Met Cys Asn Tyr Val Val 1700 1705 1710 Pro Ala Thr Leu Val Ile Ile Ile Phe Ile Cys Phe Gln Gln Lys Ser 1715 1720 1725 Tyr Val Ser Ser Thr Asn Leu Pro Val Leu Ala Leu Leu Leu Leu Leu 1730 1735 1740 Tyr Gly Trp Ser Ile Thr Pro Leu Met Tyr Pro Ala Ser Phe Val Phe 1745 1750 1755 1760 Lys Ile Pro Ser Thr Ala Tyr Val Val Leu Thr Ser Val Asn Leu Phe 1765 1770 1775 Ile Gly Ile Asn Gly Ser Val Ala Thr Phe Val Leu Glu Leu Phe Thr 1780 1785 1790 Asp Asn Lys Leu Asn Asn Ile Asn Asp Ile Leu Lys Ser Val Phe Leu 1795 1800 1805 Ile Phe Pro His Phe Cys Leu Gly Arg Gly Leu Ile Asp Met Val Lys 1810 1815 1820 Asn Gln Ala Met Ala Asp Ala Leu Glu Arg Phe Gly Glu Asn Arg Phe 1825 1830 1835 1840 Val Ser Pro Leu Ser Trp Asp Leu Val Gly Arg Asn Leu Phe Ala Met 1845 1850 1855 Ala Val Glu Gly Val Val Phe Phe Leu Ile Thr Val Leu Ile Gln Tyr 1860 1865 1870 Arg Phe Phe Ile Arg Pro Arg Pro Val Asn Ala Lys Leu Ser Pro Leu 1875 1880 1885 Asn Asp Glu Asp Glu Asp Val Arg Arg Glu Arg Gln Arg Ile Leu Asp 1890 1895 1900 Gly Gly Gly Gln Asn Asp Ile Leu Glu Ile Lys Glu Leu Thr Lys Ile 1905 1910 1915 1920 Tyr Arg Arg Lys Arg Lys Pro Ala Val Asp Arg Ile Cys Val Gly Ile 1925 1930 1935 Pro Pro Gly Glu Cys Phe Gly Leu Leu Gly Val Asn Gly Ala Gly Lys 1940 1945 1950 Ser Ser Thr Phe Lys Met Leu Thr Gly Asp Thr Thr Val Thr Arg Gly 1955 1960 1965 Asp Ala Phe Leu Asn Lys Asn Ser Ile Leu Ser Asn Ile His Glu Val 1970 1975 1980 His Gln Asn Met Gly Tyr Cys Pro Gln Phe Asp Ala Ile Thr Glu Leu 1985 1990 1995 2000 Leu Thr Gly Arg Glu His Val Glu Phe Phe Ala Leu Leu Arg Gly Val 2005 2010 2015 Pro Glu Lys Glu Val Gly Lys Val Gly Glu Trp Ala Ile Arg Lys Leu 2020 2025 2030 Gly Leu Val Lys Tyr Gly Glu Lys Tyr Ala Gly Asn Tyr Ser Gly Gly 2035 2040 2045 Asn Lys Arg Lys Leu Ser Thr Ala Met Ala Leu Ile Gly Gly Pro Pro 2050 2055 2060 Val Val Phe Leu Asp Glu Pro Thr Thr Gly Met Asp Pro Lys Ala Arg 2065 2070 2075 2080 Arg Phe Leu Trp Asn Cys Ala Leu Ser Val Val Lys Glu Gly Arg Ser 2085 2090 2095 Val Val Leu Thr Ser His Ser Met Glu Glu Cys Glu Ala Leu Cys Thr 2100 2105 2110 Arg Met Ala Ile Met Val Asn Gly Arg Phe Arg Cys Leu Gly Ser Val 2115 2120 2125 Gln His Leu Lys Asn Arg Phe Gly Asp Gly Tyr Thr Ile Val Val Arg 2130 2135 2140 Ile Ala Gly Ser Asn Pro Asp Leu Lys Pro Val Gln Asp Phe Phe Gly 2145 2150 2155 2160 Leu Ala Phe Pro Gly Ser Val Leu Lys Glu Lys His Arg Asn Met Leu 2165 2170 2175 Gln Tyr Gln Leu Pro Ser Ser Leu Ser Ser Leu Ala Arg Ile Phe Ser 2180 2185 2190 Ile Leu Ser Gln Ser Lys Lys Arg Leu His Ile Glu Asp Tyr Ser Val 2195 2200 2205 Ser Gln Thr Thr Leu Asp Gln Val Phe Val Asn Phe Ala Lys Asp Gln 2210 2215 2220 Ser Asp Asp Asp His Leu Lys Asp Leu Ser Leu His Lys Asn Gln Thr 2225 2230 2235 2240 Val Val Asp Val Ala Val Leu Thr Ser Phe Leu Gln Asp Glu Lys Val 2245 2250 2255 Lys Glu Ser Tyr Val 2260 6 7860 DNA homo sapien 6 gtccctgctg tgagctctgg ccgctgcctt ccagggctcc cgagccacac gctgggggtg 60 ctggctgagg gaacatggct tgttggcctc agctgaggtt gctgctgtgg aagaacctca 120 ctttcagaag aagacaaaca tgtcagctgt tactggaagt ggcctggcct ctatttatct 180 tcctgatcct gatctctgtt cggctgagct acccacccta tgaacaacat gaatgccatt 240 ttccaaataa agccatgccc tctgcaggaa cacttccttg ggttcagggg attatctgta 300 atgccaacaa cccctgtttc cgttacccga ctcctgggga ggctcccgga gttgttggaa 360 actttaacaa atccattgtg gctcgcctgt tctcagatgc tcggaggctt cttttataca 420 gccagaaaga caccagcatg aaggacatgc gcaaagttct gagaacatta cagcagatca 480 agaaatccag ctcaaacttg aagcttcaag atttcctggt ggacaatgaa accttctctg 540 ggttcctgta tcacaacctc tctctcccaa agtctactgt ggacaagatg ctgagggctg 600 atgtcattct ccacaaggta tttttgcaag gctaccagtt acatttgaca agtctgtgca 660 atggatcaaa atcagaagag atgattcaac ttggtgacca agaagtttct gagctttgtg 720 gcctaccaag ggagaaactg gctgcagcag agcgagtact tcgttccaac atggacatcc 780 tgaagccaat cctgagaaca ctaaactcta catctccctt cccgagcaag gagctggctg 840 aagccacaaa aacattgctg catagtcttg ggactctggc ccaggagctg ttcagcatga 900 gaagctggag tgacatgcga caggaggtga tgtttctgac caatgtgaac agctccagct 960 cctccaccca aatctaccag gctgtgtctc gtattgtctg cgggcatccc gagggagggg 1020 ggctgaagat caagtctctc aactggtatg aggacaacaa ctacaaagcc ctctttggag 1080 gcaatggcac tgaggaagat gctgaaacct tctatgacaa ctctacaact ccttactgca 1140 atgatttgat gaagaatttg gagtctagtc ctctttcccg cattatctgg aaagctctga 1200 agccgctgct cgttgggaag atcctgtata cacctgacac tccagccaca aggcaggtca 1260 tggctgaggt gaacaagacc ttccaggaac tggctgtgtt ccatgatctg gaaggcatgt 1320 gggaggaact cagccccaag atctggacct tcatggagaa cagccaagaa atggaccttg 1380 tccggatgct gttggacagc agggacaatg accacttttg ggaacagcag ttggatggct 1440 tagattggac agcccaagac atcgtggcgt ttttggccaa gcacccagag gatgtccagt 1500 ccagtaatgg ttctgtgtac acctggagag aagctttcaa cgagactaac caggcaatcc 1560 ggaccatatc tcgcttcatg gagtgtgtca acctgaacaa gctagaaccc atagcaacag 1620 aagtctggct catcaacaag tccatggagc tgctggatga gaggaagttc tgggctggta 1680 ttgtgttcac tggaattact ccaggcagca ttgagctgcc ccatcatgtc aagtacaaga 1740 tccgaatgga cattgacaat gtggagagga caaataaaat caaggatggg tactgggacc 1800 ctggtcctcg agctgacccc tttgaggaca tgcggtacgt ctgggggggc ttcgcctact 1860 tgcaggatgt ggtggagcag gcaatcatca gggtgctgac gggcaccgag aagaaaactg 1920 gtgtctatat gcaacagatg ccctatccct gttacgttga tgacatcttt ctgcgggtga 1980 tgagccggtc aatgcccctc ttcatgacgc tggcctggat ttactcagtg gctgtgatca 2040 tcaagggcat cgtgtatgag aaggaggcac ggctgaaaga gaccatgcgg atcatgggcc 2100 tggacaacag catcctctgg tttagctggt tcattagtag cctcattcct cttcttgtga 2160 gcgctggcct gctagtggtc atcctgaagt taggaaacct gctgccctac agtgatccca 2220 gcgtggtgtt tgtcttcctg tccgtgtttg ctgtggtgac aatcctgcag tgcttcctga 2280 ttagcacact cttctccaga gccaacctgg cagcagcctg tgggggcatc atctacttca 2340 cgctgtacct gccctacgtc ctgtgtgtgg catggcagga ctacgtgggc ttcacactca 2400 agatcttcgc tagcctgctg tctcctgtgg cttttgggtt tggctgtgag tactttgccc 2460 tttttgagga gcagggcatt ggagtgcagt gggacaacct gtttgagagt cctgtggagg 2520 aagatggctt caatctcacc acttcggtct ccatgatgct gtttgacacc ttcctctatg 2580 gggtgatgac ctggtacatt gaggctgtct ttccaggcca gtacggaatt cccaggccct 2640 ggtattttcc ttgcaccaag tcctactggt ttggcgagga aagtgatgag aagagccacc 2700 ctggttccaa ccagaagaga atatcagaaa tctgcatgga ggaggaaccc acccacttga 2760 agctgggcgt gtccattcag aacctggtaa aagtctaccg agatgggatg aaggtggctg 2820 tcgatggcct ggcactgaat ttttatgagg gccagatcac ctccttcctg ggccacaatg 2880 gagcggggaa gacgaccacc atgtcaatcc tgaccgggtt gttccccccg acctcgggca 2940 ccgcctacat cctgggaaaa gacattcgct ctgagatgag caccatccgg cagaacctgg 3000 gggtctgtcc ccagcataac gtgctgtttg acatgctgac tgtcgaagaa cacatctggt 3060 tctatgcccg cttgaaaggg ctctctgaga agcacgtgaa ggcggagatg gagcagatgg 3120 ccctggatgt tggtttgcca tcaagcaagc tgaaaagcaa aacaagccag ctgtcaggtg 3180 gaatgcagag aaagctatct gtggccttgg cctttgtcgg gggatctaag gttgtcattc 3240 tggatgaacc cacagctggt gtggaccctt actcccgcag gggaatatgg gagctgctgc 3300 tgaaataccg acaaggccgc accattattc tctctacaca ccacatggat gaagcggacg 3360 tcctggggga caggattgcc atcatctccc atgggaagct gtgctgtgtg ggctcctccc 3420 tgtttctgaa gaaccagctg ggaacaggct actacctgac cttggtcaag aaagatgtgg 3480 aatcctccct cagttcctgc agaaacagta gtagcactgt gtcatacctg aaaaaggagg 3540 acagtgtttc tcagagcagt tctgatgctg gcctgggcag cgaccatgag agtgacacgc 3600 tgaccatcga tgtctctgct atctccaacc tcatcaggaa gcatgtgtct gaagcccggc 3660 tggtggaaga catagggcat gagctgacct atgtgctgcc atatgaagct gctaaggagg 3720 gagcctttgt ggaactcttt catgagattg atgaccggct ctcagacctg ggcatttcta 3780 gttatggcat ctcagagacg accctggaag aaatattcct caaggtggcc gaagagagtg 3840 gggtggatgc tgagacctca gatggtacct tgccagcaag acgaaacagg cgggccttcg 3900 gggacaagca gagctgtctt cgcccgttca ctgaagatga tgctgctgat ccaaatgatt 3960 ctgacataga cccagaatcc agagagacag acttgctcag tgggatggat ggcaaagggt 4020 cctaccaggt gaaaggctgg aaacttacac agcaacagtt tgtggccctt ttgtggaaga 4080 gactgctaat tgccagacgg agtcggaaag gattttttgc tcagattgtc ttgccagctg 4140 tgtttgtctg cattgccctt gtgttcagcc tgatcgtgcc accctttggc aagtacccca 4200 gcctggaact tcagccctgg atgtacaacg aacagtacac atttgtcagc aatgatgctc 4260 ctgaggacac gggaaccctg gaactcttaa acgccctcac caaagaccct ggcttcggga 4320 cccgctgtat ggaaggaaac ccaatcccag acacgccctg ccaggcaggg gaggaagagt 4380 ggaccactgc cccagttccc cagaccatca tggacctctt ccagaatggg aactggacaa 4440 tgcagaaccc ttcacctgca tgccagtgta gcagcgacaa aatcaagaag atgctgcctg 4500 tgtgtccccc aggggcaggg gggctgcctc ctccacaaag aaaacaaaac actgcagata 4560 tccttcagga cctgacagga agaaacattt cggattatct ggtgaagacg tatgtgcaga 4620 tcatagccaa aagcttaaag aacaagatct gggtgaatga gtttaggtat ggcggctttt 4680 ccctgggtgt cagtaatact caagcacttc ctccgagtca agaagttaat gatgccatca 4740 aacaaatgaa gaaacaccta aagctggcca aggacagttc tgcagatcga tttctcaaca 4800 gcttgggaag atttatgaca ggactggaca ccagaaataa tgtcaaggtg tggttcaata 4860 acaagggctg gcatgcaatc agctctttcc tgaatgtcat caacaatgcc attctccggg 4920 ccaacctgca aaagggagag aaccctagcc attatggaat tactgctttc aatcatcccc 4980 tgaatctcac caagcagcag ctctcagagg tggctctgat gaccacatca gtggatgtcc 5040 ttgtgtccat ctgtgtcatc tttgcaatgt ccttcgtccc agccagcttt gtcgtattcc 5100 tgatccagga gcgggtcagc aaagcaaaac acctgcagtt catcagtgga gtgaagcctg 5160 tcatctactg gctctctaat tttgtctggg atatgtgcaa ttacgttgtc cctgccacac 5220 tggtcattat catcttcatc tgcttccagc agaagtccta tgtgtcctcc accaatctgc 5280 ctgtgctagc ccttctactt ttgctgtatg ggtggtcaat cacacctctc atgtacccag 5340 cctcctttgt gttcaagatc cccagcacag cctatgtggt gctcaccagc gtgaacctct 5400 tcattggcat taatggcagc gtggccacct ttgtgctgga gctgttcacc gacaataagc 5460 tgaataatat caatgatatc ctgaagtccg tgttcttgat cttcccacat ttttgcctgg 5520 gacgagggct catcgacatg gtgaaaaacc aggcaatggc tgatgccctg gaaaggtttg 5580 gggagaatcg ctttgtgtca ccattatctt gggacttggt gggacgaaac ctcttcgcca 5640 tggccgtgga aggggtggtg ttcttcctca ttactgttct gatccagtac agattcttca 5700 tcaggcccag acctgtaaat gcaaagctat ctcctctgaa tgatgaagat gaagatgtga 5760 ggcgggaaag acagagaatt cttgatggtg gaggccagaa tgacatctta gaaatcaagg 5820 agttgacgaa gatatataga aggaagcgga agcctgctgt tgacaggatt tgcgtgggca 5880 ttcctcctgg tgagtgcttt gggctcctgg gagttaatgg ggctggaaaa tcatcaactt 5940 tcaagatgtt aacaggagat accactgtta ccagaggaga tgctttcctt aacaaaaata 6000 gtatcttatc aaacatccat gaagtacatc agaacatggg ctactgccct cagtttgatg 6060 ccatcacaga gctgttgact gggagagaac acgtggagtt ctttgccctt ttgagaggag 6120 tcccagagaa agaagttggc aaggttggtg agtgggcgat tcggaaactg ggcctcgtga 6180 agtatggaga aaaatatgct ggtaactata gtggaggcaa caaacgcaag ctctctacag 6240 ccatggcttt gatcggcggg cctcctgtgg tgtttctgga tgaacccacc acaggcatgg 6300 atcccaaagc ccggcggttc ttgtggaatt gtgccctaag tgttgtcaag gaggggagat 6360 cagtagtgct tacatctcat agtatggaag aatgtgaagc tctttgcact aggatggcaa 6420 tcatggtcaa tggaaggttc aggtgccttg gcagtgtcca gcatctaaaa aataggtttg 6480 gagatggtta tacaatagtt gtacgaatag cagggtccaa cccggacctg aagcctgtcc 6540 aggatttctt tggacttgca tttcctggaa gtgttctaaa agagaaacac cggaacatgc 6600 tacaatacca gcttccatct tcattatctt ctctggccag gatattcagc atcctctccc 6660 agagcaaaaa gcgactccac atagaagact actctgtttc tcagacaaca cttgaccaag 6720 tatttgtgaa ctttgccaag gaccaaagtg atgatgacca cttaaaagac ctctcattac 6780 acaaaaacca gacagtagtg gacgttgcag ttctcacatc ttttctacag gatgagaaag 6840 tgaaagaaag ctatgtatga agaatcctgt tcatacgggg tggctgaaag taaagaggaa 6900 ctagactttc ctttgcacca tgtgaagtgt tgtggagaaa agagccagaa gttgatgtgg 6960 gaagaagtaa actggatact gtactgatac tattcaatgc aatgcaattc aatgcaatga 7020 aaacaaaatt ccattacagg ggcagtgcct ttgtagccta tgtcttgtat ggctctcaag 7080 tgaaagactt gaatttagtt ttttacctat acctatgtga aactctatta tggaacccaa 7140 tggacatatg ggtttgaact cacacttttt tttttttttt tgttcctgtg tattctcatt 7200 ggggttgcaa caataattca tcaagtaatc atggccagcg attattgatc aaaatcaaaa 7260 ggtaatgcac atcctcattc actaagccat gccatgccca ggagactggt ttcccggtga 7320 cacatccatt gctggcaatg agtgtgccag agttattagt gccaagtttt tcagaaagtt 7380 tgaagcacca tggtgtgtca tgctcacttt tgtgaaagct gctctgctca gagtctatca 7440 acattgaata tcagttgaca gaatggtgcc atgcgtggct aacatcctgc tttgattccc 7500 tctgataagc tgttctggtg gcagtaacat gcaacaaaaa tgtgggtgtc tccaggcacg 7560 ggaaacttgg ttccattgtt atattgtcct atgcttcgag ccatgggtct acagggtcat 7620 ccttatgaga ctcttaaata tacttagatc ctggtaagag gcaaagaatc aacagccaaa 7680 ctgctggggc tgcaactgct gaagccaggg catgggatta aagagattgt gcgttcaaac 7740 ctagggaagc ctgtgcccat ttgtcctgac tgtctgctaa catggtacac tgcatctcaa 7800 gatgtttatc tgacacaagt gtattatttc tggctttttg aattaatcta gaaaatgaaa 7860 7 16 DNA homo sapien misc_feature (7)...(10) n = a, t, g or c 7 aggtcannnn aggtca 16 8 26 DNA homo sapien 8 agaggcaggt ggatcatttg aggtca 26 9 26 DNA homo sapien 9 ttgaggcggg tgatcacttg aggtca 26 10 26 DNA homo sapiens 10 caaggcgggc agatcacttg aggtta 26 11 26 DNA Homo sapien 11 caaggtgggc agctcacctc aggtca 26 12 24 DNA Homo sapiens misc_feature (1)...(18) n= a, t, g, c, other or none. 12 nnnnnnannt tgacctnntg acct 24 13 24 DNA Homo sapiens 13 ctttgaagcc tgatcatatg acct 24 14 24 DNA Homo sapiens 14 aggctggtct cgaactcctg acct 24 15 24 DNA Homo sapien 15 cttaattggt ggwgctgttg acct 24 16 24 DNA Homo sapien 16 caggatggcg taaactcctg acct 24 17 24 DNA homo sapien 17 aggttggttt cgaactcctg acct 24 18 24 DNA Homo sapein 18 tcaaggtagg agaccttgtg gcct 24 19 10 DNA homo sapien 19 atcaccccac 10 20 20 DNA homo sapien 20 gagatgtgct atgaccccac 20 21 20 DNA homo sapien 21 gtgagcccag atcacaccac 20 22 20 DNA homo sapien 22 tccatccatc cacaccccac 20 23 20 DNA homo sapien 23 cccttttatt aacacctcac 20 24 20 DNA homo sapien 24 gtaagccaag atcatgccac 20 25 20 DNA homo sapien 25 acctcaagtg atcacccgcc 20 26 20 DNA homo sapien 26 ggctcaagcg atcctcccac 20 27 20 DNA homo sapien 27 ccatgattgg atcactgcac 20 28 20 DNA homo sapien 28 gtgagtcgag atcatgccac 20 29 20 DNA homo sapien 29 tgcttttgtt ttccccccac 20 30 20 DNA homo sapien 30 ccgccttccc ctcaccccag 20 31 20 DNA homo sapien 31 accctccacc cccaccccac 20 32 16 DNA homo sapien variation (1)...(11) n at positions 2-9 is a, t, g or c; n at positions 1 and 10 is a or t; n at position 11 is a or g. 32 nnnnnnnnnn nggtca 16 33 18 DNA homo sapien 33 ctgggcaagg atgggtca 18 34 17 DNA homo sapiens 34 tgggcaagga tgggtca 17 35 17 DNA homo sapiens 35 aaaaagcacc aaggtca 17 36 17 DNA homo sapien 36 agaagatgcc agggtca 17 37 17 DNA homo sapiens 37 gaggagatgg agggtca 17 38 26 DNA homo sapiens 38 ccgagcgcag aggttactat cggtca 26 39 26 DNA homo sapien 39 gccaattccc aggtcagaac agacca 26 40 20 DNA homo sapien 40 ggacctgcag ctctccccac 20 41 17 DNA homo sapien 41 aacgcccaag taagtca 17 42 17 DNA homo sapien 42 gagctcgtac taggaca 17 43 17 DNA homo sapien 43 gcagagtcct ggggtca 17 44 18 DNA homo sapien 44 cgcagagtcc tggggtca 18 45 17 DNA homo sapien 45 agccaattcc caggtca 17 46 17 DNA homo sapien 46 acggaccgtt tgggaca 17 47 18 DNA homo sapien 47 cacggaccgt ttgggaca 18 48 19 DNA homo sapien 48 ccacggaccg tttgggaca 19 49 18 DNA homo sapien 49 actagaggcc ttgggtct 18 50 17 DNA homo sapien 50 ctagaggcct tgggtct 17 51 17 DNA homo sapien 51 ccctacccct caggtca 17 52 18 DNA homo sapien 52 tccctacccc tcaggtca 18 53 17 DNA homo sapien misc_feature (2)...(2) n = c or g. 53 gntctgcgcc agggaca 17 54 17 DNA homo sapien misc_feature (12)...(12) n = a, t, g or c. 54 ttttagtgag anggtta 17 55 26 DNA homo sapien 55 tgaggcaggt agatcacttg aggtca 26 56 26 DNA homo sapien 56 cgaggctggc ggatcacctg aggtca 26 57 26 DNA homo sapien 57 aagcctaaca aggttactga aggcca 26 58 26 DNA homo sapien 58 agaggtgggc ggatcacctg aggtca 26 59 24 DNA homo sapien 59 ctcgatttcc tgacctcgtg atcc 24 60 24 DNA homo sapien 60 caaaacattg tgcccttttg aact 24 61 24 DNA homo sapien 61 gcgctagggt tgtcctcatt acct 24 62 24 DNA homo sapien 62 ctcgatttct tgacctcgtg atcc 24 63 20 DNA homo sapien 63 gtgagctgag atcacaccac 20 64 20 DNA homo sapien 64 ttcaaggatg atcaccacat 20 65 20 DNA homo sapien 65 ggctcaagtg atcctcccac 20 66 20 DNA homo sapien 66 gtgagccgag atcgcgccac 20 67 20 DNA homo sapien 67 gtgagttatg atcatgccac 20 68 20 DNA homo sapien 68 ccactgtttg aacaacccac 20 69 20 DNA homo sapien 69 acctcaggtg atccgcccac 20 70 20 DNA homo sapien 70 aaatgtgaca atctccacac 20 71 20 DNA homo sapien 71 aatatagaat atcacctccc 20 72 20 DNA homo sapien 72 ccttttatct accacccaac 20 73 17 DNA homo sapien 73 ccttgttgga tgggtca 17 74 18 DNA homo sapien 74 gccttgttgg atgggtca 18 75 17 DNA homo sapien 75 ttgctgtgag tgggtca 17 76 17 DNA homo sapien 76 gccttgcaga agggtca 17 77 18 DNA homo sapien 77 ggccttgcag aagggtca 18 78 18 DNA homo sapien 78 aattaagctg atgggtca 18 79 17 DNA homo sapien 79 attaagctga tgggtca 17 80 18 DNA homo sapien 80 aggtgctaac tagggtca 18 81 17 DNA homo sapien 81 ggtgctaact agggtca 17 82 17 DNA homo sapien 82 atgggatgac tgggtca 17 83 17 DNA homo sapien 83 tctccatgcc aaggtca 17 84 23 DNA homo sapien 84 ttgaggacat gcggtacgtc tgg 23 85 23 DNA homo sapien 85 ttgaggacat gtggtacgtc tgg 23 86 21 DNA homo sapien 86 gcctacttgc aggatgtggt g 21 87 21 DNA homo sapien 87 gcctacttgc gggatgtggt g 21 88 23 DNA homo sapien 88 cctcattcct cttcttgtga gcg 23 89 20 DNA homo sapien 89 cctcattcct cttgtgagcg 20 90 21 DNA homo sapien 90 aaaagtctac cgagatggga t 21 91 21 DNA homo sapien 91 aaaagtctac tgagatggga t 21 92 21 DNA homo sapien 92 ggccagatca cctccttcct g 21 93 21 DNA homo sapien 93 ggccagatca tctccttcct g 21 94 21 DNA homo sapien 94 acacaccaca tggatgaagc g 21 95 21 DNA homo sapien 95 acacaccaca cggatgaagc g 21 96 21 DNA homo sapien 96 cctggaagaa gtaagttaag t 21 97 21 DNA homo sapien 97 cctggaagaa ctaagttaag t 21 98 21 DNA homo sapien 98 gctgcctgtg tgtcccccag g 21 99 21 DNA homo sapien 99 gctgcctgtg cgtcccccag g 21 100 22 DNA homo sapien 100 tagccattat ggaattactg ct 22 101 21 DNA homo sapien 101 tagccattat caattactgc t 21 102 26 DNA homo sapien 102 gatgaagatg aagatgtgag gcggga 26 103 20 DNA homo sapien 103 gatgaagatg tgaggcggga 20 104 21 DNA homo sapien 104 aatagttgta cgaatagcag g 21 105 21 DNA homo sapien 105 aatagttgta tgaatagcag g 21 106 20 DNA homo sapien 106 atagttgtac gaatagcagg 20 107 19 DNA homo sapien 107 atagttgtag aatagcagg 19 108 20 DNA homo sapien 108 gggtccaacc cggacctgaa 20 109 20 DNA homo sapien 109 gggtccaacc tggacctgaa 20 110 21 DNA homo sapien 110 cattatcttc tctggccagg a 21 111 20 DNA homo sapien 111 cattatcttt ttggccagga 20 112 20 DNA homo sapien 112 ggaactagtc ccggcaaaaa 20 113 20 DNA homo sapien 113 ggaactagtc tcggcaaaaa 20 114 17 DNA homo sapien 114 ccgggacccg cagagcc 17 115 17 DNA homo sapien 115 ccgggaccgg cagagcc 17 116 20 DNA homo sapien 116 accagccacg gcgtccctgc 20 117 21 DNA homo sapien 117 accagccacg ggcgtccctg c 21 118 21 DNA homo sapien 118 acacgctggg ggtgctggct g 21 119 21 DNA homo sapien 119 acacgctggg cgtgctggct g 21 120 21 DNA homo sapien 120 ctgggttcct gtatcacaac c 21 121 21 DNA homo sapien 121 ctgggttcct atatcacaac c 21 122 21 DNA homo sapien 122 ggcctaccaa gggagaaact g 21 123 21 DNA homo sapien 123 ggcctaccaa aggagaaact g 21 124 21 DNA homo sapien 124 gcgggcatcc cgagggaggg g 21 125 21 DNA homo sapien 125 gcgggcatcc tgagggaggg g 21 126 21 DNA homo sapien 126 agggaggggg gctgaagatc a 21 127 21 DNA homo sapien 127 agggaggggg actgaagatc a 21 128 21 DNA homo sapien 128 tgactccagg tgaacaagac c 21 129 21 DNA homo sapien 129 tgactccagg cgaacaagac c 21 130 21 DNA homo sapien 130 gcaggactac gtgggcttca c 21 131 21 DNA homo sapien 131 gcaggactac atgggcttca c 21 132 21 DNA homo sapien 132 cgtgggcttc acactcaaga t 21 133 21 DNA homo sapien 133 cgtgggcttc ccactcaaga t 21 134 21 DNA homo sapien 134 tcacactcaa gatcttcgct g 21 135 21 DNA homo sapien 135 tcacactcaa catcttcgct g 21 136 17 DNA homo sapien 136 ccacttcggt ctccatg 17 137 17 DNA homo sapien 137 ccacttcgat ctccatg 17 138 18 DNA homo sapien 138 gaagagaata tcagaaag 18 139 18 DNA homo sapien 139 gaagagaatg tcagaaag 18 140 21 DNA homo sapien 140 gatctaaggt tgtcattctg g 21 141 21 DNA homo sapien 141 gatctaaggt ggtcattctg g 21 142 20 DNA homo sapien 142 gcgaccatga gagtgacacg 20 143 20 DNA homo sapien 143 gcgaccatga cagtgacacg 20 144 21 DNA homo sapien 144 ctggacacca gaaataatgt c 21 145 21 DNA homo sapien 145 ctggacacca aaaataatgt c 21 146 21 DNA homo sapien 146 tcctatgtgt cctccaccaa t 21 147 21 DNA homo sapien 147 tcctatgtgt gctccaccaa t 21 148 21 DNA homo sapien 148 cagggtccaa cccggacctg a 21 149 21 DNA homo sapien 149 cagggtccaa tccggacctg a 21 150 22 DNA homo sapien 150 cgggggaagg gcacgcagac cg 22 151 22 DNA homo sapien 151 cgggggaagg ggacgcagac cg 22 152 17 DNA homo sapien 152 ggccgggaac ggggcgg 17 153 17 DNA homo sapien 153 ggccgggaag ggggcgg 17 154 21 DNA homo sapien 154 agtatccctt gttcacgaga a 21 155 25 DNA homo sapien 155 agtatccctc ccttgttcac gagaa 25 156 21 DNA homo sapien 156 gtgacaccca acggagtagg g 21 157 21 DNA homo sapien 157 gtgacaccca gcggagtagg g 21 158 25 DNA homo sapien 158 tatgtgctga ccatgggagc ttgtt 25 159 25 DNA homo sapien 159 tatgtgctga ccgtgggagc ttgtt 25 160 25 DNA homo sapien 160 cctccgcctg ccaggttcag cgatt 25 161 25 DNA homo sapien 161 cctccgcctg ccgggttcag cgatt 25 162 21 DNA homo sapien 162 gaaaattagt atgtaaggaa g 21 163 21 DNA homo sapien 163 gaaaattagt ctgtaaggaa g 21 164 22 DNA homo sapien 164 cattttctta gaaaagagag gt 22 165 22 DNA homo sapien 165 cattttctta gagaagagag gt 22 166 20 DNA homo sapien misc_feature (11)...(11) n = g or t 166 tttaaagggg ntgattagga 20 167 22 DNA homo sapien misc_feature (11)...(11) n = g or t 167 gaagaaattt ntttttttga tt 22 168 19 DNA homo sapien 168 tctgtcccca tccctgacg 19 169 20 DNA homo sapien 169 tctgtcccca atccctgacg 20 170 20 DNA homo sapien 170 aggagccaaa cgctcattgt 20 171 21 DNA homo sapien 171 aggagccaaa gcgctcattg t 21 172 21 DNA homo sapien 172 aagccactgt ttttaaccag t 21 173 21 DNA homo sapien 173 aagccactgt atttaaccag t 21 174 22 DNA homo sapien 174 gctccctcta gcatgcaggc tc 22 175 22 DNA homo sapien 175 gctccctcta gtatgcaggc tc 22 176 21 DNA homo sapien 176 ttgcctgttt ctcacagagc c 21 177 19 DNA homo sapien 177 ttgcctgttt ctcagagcc 19 178 21 DNA homo sapien misc_feature (10)...(10) n = c or g 178 gcgcagtgcn ctgtgtcctt a 21 179 23 DNA homo sapien misc_feature (12)...(12) n = g or t 179 ctcttctgtt ancacagaag aga 23 180 21 DNA homo sapien misc_feature (11)...(11) n = a or g 180 cattctaggg ntcatagcca t 21 181 22 DNA homo sapien misc_feature (11)...(11) n = g or t 181 aagtacagtg ngaggaacag cg 22 182 22 DNA homo sapien misc_feature (12)...(12) n = a or g 182 attcctaaaa antagaaatg ca 22 183 23 DNA homo sapien 183 tttctgtttc aattcttgtc tat 23 184 23 DNA homo sapien 184 tttctgtttc agttcttgtc tat 23 185 21 DNA homo sapien 185 ggcccctgcc ttattattac t 21 186 21 DNA homo sapien 186 ggcccctgcc gtattattac t 21 187 21 DNA homo sapien 187 cactgtctgg gttttaatgt c 21 188 21 DNA homo sapien 188 cactgtctgg cttttaatgt c 21 189 22 DNA homo sapien misc_feature (11)...(11) n = a or g 189 tgagagaatt ncttgaaccc gg 22 190 21 DNA homo sapien misc_feature (11)...(11) n = c or t 190 tttgctgaaa naatcactga c 21 191 22 DNA homo sapien misc_feature (11)...(11) n = c or t 191 aacctcagtt ncctcatctg tg 22 192 21 DNA homo sapien 192 aagaagtggc ttgtattttg c 21 193 21 DNA homo sapien 193 aagaagtggc ctgtattttg c 21 194 23 DNA homo sapien misc_feature (11)...(11) n = a or g 194 aactgatttg nttggtatag ctg 23 195 20 DNA homo sapien 195 aataaagata atttctttgg 20 196 20 DNA homo sapien 196 aataaagata gtttctttgg 20 197 22 DNA homo sapien 197 ttcctgcccc gacactcccg cc 22 198 22 DNA homo sapien 198 ttcctgcccc cacactcccg cc 22 199 22 DNA homo sapien misc_feature (11)...(11) n = a or g 199 ttgggaggct naggcaggag aa 22 200 22 DNA homo sapien 200 tgtcagctgt tactggaagt gg 22 201 22 DNA homo sapien 201 tgtcagctgc tgctggaagt gg 22 202 21 DNA homo sapien 202 aggagctggc cgaagccaca a 21 203 21 DNA homo sapien 203 aggagctggc tgaagccaca a 21 204 21 DNA homo sapien 204 aatgatgcca ccaaacaaat g 21 205 21 DNA homo sapien 205 aatgatgcca tcaaacaaat g 21 206 21 DNA homo sapien 206 gaggtggctc cgatgaccac a 21 207 21 DNA homo sapien 207 gaggtggctc tgatgaccac a 21 208 21 DNA homo sapien 208 ttccttaaca gaaatagtat c 21 209 21 DNA homo sapien 209 ttccttaaca aaaatagtat c 21 210 21 DNA homo sapien 210 ggaagtgttc caaaagagaa a 21 211 34 DNA Artificial Sequence Synthetic primer 211 gtatttttgc aaggctacca gttacatttg acaa 34 212 28 DNA Artificial Sequence Synthetic primer 212 gattggcttc aggatgtcca tgttggaa 28 213 20 DNA Artificial Sequence Synthetic primer 213 gctgctgtga tggggtatct 20 214 20 DNA Artificial Sequence Synthetic primer 214 acctcactca cacctgggaa 20 215 20 DNA Artificial Sequence Synthetic primer 215 caagtgagtg cttgggattg 20 216 21 DNA Artificial Sequence Synthetic primer 216 tgcttttatt cagggactcc a 21 217 25 DNA Artificial Sequence Synthetic primer 217 gtgatcccag cgtggtgttt gtctt 25 218 37 DNA Artificial Sequence Synthetic primer 218 gaaaggccag aggtactcac agcgaagatc ttgaggg 37 219 21 DNA Artificial Sequence Synthetic primer 219 tcgttttatt cagggactcc a 21 220 20 DNA Artificial Sequence Synthetic primer 220 caagtgagtg cttgggattg 20 221 20 DNA Artificial Sequence Synthetic primer 221 cccatgcact gcagagattc 20 222 21 DNA Artificial Sequence Synthetic primer 222 gcaaattcaa atttctccag g 21 223 35 DNA Artificial Sequence Synthetic primer 223 gagaagagcc accctggttc caaccagaag aggat 35 224 19 DNA Artificial Sequence Synthetic primer 224 aaggcaggag acatcgctt 19 225 36 DNA Artificial Sequence Synthetic primer 225 gagcagttct gatgctggcc tgggcagcga ccacga 36 226 20 DNA Artificial Sequence Synthetic primer 226 tctgcacctc tcctcctctg 20 227 35 DNA Artificial Sequence Synthetic primer 227 cagcttggga agatttatga caggactgga cacga 35 228 18 DNA Artificial Sequence Synthetic primer 228 atgcccctgc caacttac 18 229 27 DNA Artificial Sequence Synthetic primer 229 gtgcaattac gttgtccctg ccacact 27 230 30 DNA Artificial Sequence Synthetic primer 230 ccatacagca aaagtagaag ggctagcaca 30 231 16 DNA homo sapien misc_feature (7)...(10) n = a, t, g or c 231 agatcannnn aggtca 16 232 16 DNA Homo sapien 232 agatcacttg aggtca 16 233 22 DNA Artificial Sequence Synthetic Primer 233 cagcgcttcc cgcgcgtctt ag 22 234 24 DNA Artificial Sequence Synthetic Primer 234 ccactcactc tcgtccgcaa ttac 24 235 38 DNA Artificial Sequence Synthetic Primer 235 ctgctgagtg actgaactac ataaacagag gccgggta 38 236 24 DNA Artificial Sequence Synthetic Primer 236 ccactcactc tcgtccgcaa ttac 24 237 22 DNA Artificial Sequence Synthetic Primer 237 cagcgcttcc cgcgcgtctt ag 22 238 24 DNA Artificial Sequence Synthetic Primer 238 ccactcactc tcgtccgcaa ttac 24 239 20 DNA Artificial Sequence Synthetic Primer 239 ctggctttct gctgagtgac 20 240 19 DNA Artificial Sequence Synthetic Primer 240 gatcaaagtc cccgaaacc 19 241 30 DNA Artificial Sequence Synthetic Primer 241 actcagttgt ataacccact gaaaatgagt 30 242 23 DNA Artificial Sequence Synthetic Primer 242 ttctatagat gttatcatct ggg 23 243 30 DNA Artificial Sequence Synthetic Primer 243 actcagttgt ataacccact gaaaatgagt 30 244 23 DNA Artificial Sequence Synthetic Primer 244 ttctatagat gttatcatct ggg 23 245 20 DNA Artificial Sequence Synthetic Primer 245 tcatctaagg cacgttgtgg 20 246 20 DNA Artificial Sequence Synthetic Primer 246 cctcaagcct ggagtgactt 20 247 20 DNA Artificial Sequence Synthetic Primer 247 atggcaaaca gtcctccaag 20 248 20 DNA Artificial Sequence Synthetic Primer 248 accctagcgc tgtgtctctg 20 249 20 DNA Artificial Sequence Synthetic Primer 249 atggcaaaca gtcctccaag 20 250 20 DNA Artificial Sequence Synthetic Primer 250 accctagcgc tgtgtctctg 20 251 20 DNA Artificial Sequence Synthetic Primer 251 tgtgtgtcct cccttccatt 20 252 20 DNA Artificial Sequence Synthetic Primer 252 cttggaggac tgtttgccat 20 253 38 DNA Artificial Sequence Synthetic Primer 253 cccctcctgc tttatctttc agttaatgac cagccccg 38 254 20 DNA Artificial Sequence Synthetic Primer 254 atccccaact caaaaccaca 20 255 37 DNA Artificial Sequence Synthetic Primer 255 gccgctgcct tccagggctc ccgagccaca cgctgcg 37 256 20 DNA Artificial Sequence Synthetic Primer 256 atccccaact caaaaccaca 20 

What is claimed is:
 1. A substantially pure nucleic acid molecule comprising a region that is substantially identical to at least fifty contiguous nucleotides of nucleotides 5854 to 6694, 7756 to 8318, 10479 to 10825, 15214 to 16068, 21636 to 22111, 27898 to 28721, 32951 to 33743, 36065 to 36847, 39730 to 40577, 4543 to 5287, or 45081 to 55639 of SEQ ID NO:
 1. 2. A substantially pure nucleic acid molecule comprising a region that is substantially identical to nucleotides 1 to 28,707 of SEQ ID NO:
 1. 3. A substantially pure nucleic acid molecule comprising a region that is substantially identical to nucleotides 29,011 to 53,228 of SEQ ID NO:
 1. 4. A cell expressing the nucleic acid molecule of claim
 1. 5. A non-human mammal expressing the nucleic acid molecule of claim
 1. 6. A method for predicting a person's response to a triglyceride-lowering drug, comprising determining whether the person has a polymorphism in an ABC1 gene, promoter, or regulatory sequence that alters the person's response to said drug.
 7. A method of determining risk of developing a cardiovascular disease in a mammal, comprising determining the presence of a polymorphism in the amino acid sequence of an ABCA1 polypeptide of a mammal wherein said ABCA1 polymorphism indicates risk of developing a cardiovascular disease and wherein said ABCA1 polymorphism is a member selected from the group consisting of R587W, 2145Stop, P2150L, 2203Stop, V399A, Y774P, V8251, 1883M, E1172D and R1587K.
 8. The method of claim 7 wherein said mammal is a human being.
 9. The method of claim 8, wherein said polymorphism is in the amino acid sequence of SEQ ID NO:
 5. 10. The method of claim 7, wherein more than one polymorphism in said ABCA1 protein is determined.
 11. The method of claim 7, wherein said polymorphism is determined in the gene encoding said ABCA1 protein.
 12. The method of claim 11, wherein said polymorphism in the gene is heterozygous.
 13. The method of claim 11, wherein said polymorphism in the gene is homozygous.
 14. The method of claim 7, wherein said cardiovascular disease is a member of the group consisting of coronary artery disease (CAD), cerebrovascular disease, coronary restenosis, atherosclerosis and peripheral vascular disease.
 15. The method of claim 14, wherein said cardiovascular disease is coronary artery disease (CAD).
 16. The method of claim 14 wherein said cardiovascular disease is atherosclerosis.
 17. A method of determining risk of developing low plasma HDL levels in a mammal, comprising determining the presence of a polymorphism in the amino acid sequence of an ABCA1 polypeptide of a mammal wherein said ABCA1 polymorphism indicates risk of developing low HDL levels and wherein said ABCA1 polymorphism is a member selected from the group consisting of R587W, 2145Stop, P2150L, 2203Stop, V399A, Y774P, V825I, I883M, E1172D and R1587K.
 18. The method of claim 17, wherein said mammal is a human patient.
 19. The method of claim 18, wherein said polymorphism is in the amino acid sequence of SEQ ID NO:
 1. 20. The method of claim 17, wherein said determination results in a finding of more than one polymorphism in said ABCA1 protein.
 21. The method of claim 17, wherein said polymorphism is determined in the gene encoding said ABCA1 protein.
 22. The method of claim 21, wherein said polymorphism in the gene is heterozygous.
 23. The method of claim 21, wherein said polymorphism in the gene is homozygous.
 24. A method of determining risk of developing elevated plasma triglyceride levels in a mammal, comprising determining the presence of a polymorphism in the amino acid sequence of an ABCA1 polypeptide of a mammal wherein the presence of said ABCA1 polymorphism indicates risk of developing elevated triglyceride levels and said wherein ABCA1 polymorphism is a member selected from the group consisting of R587W, 2145Stop, P2150L, 2203Stop, V399A, Y774P, V8251, 1883M, E1172D and R1587K.
 25. The method of claim 24, wherein said mammal is a human patient.
 26. The method of claim 24, wherein more than one polymorphism in said ABCA1 protein.
 27. The method of claim 24, wherein said polymorphism is determined in the gene encoding said ABCA1 protein.
 28. The method of claim 27, wherein said polymorphism in the gene is heterozygous.
 29. The method of claim 27, wherein said polymorphism in the gene is homozygous. 